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On an Impulsive Control of Additive Processes 


Hideo Nagai 


Department of Mathematics, Tokyo Metropolitan University 
Fukasawa, Setagaya, Tokyo 158 


Let (Q,2,2,,P.,X,,0,) be a temporally homogeneous Lévy Process on R" 
(spatially homogeneous Hunt process on R"). We assume that the process is of 
function space type and has a reference measure (cf. Appendix and [2]). Take a 
sequence of &,-stopping times {t,}, such that 


% St, for each k (0.1) 
where <holds unless t, =, and 


limt,= © as. (0.2) 


k-0o 


and a sequence of [0,©)"-valued random variables {€,}, such that each €, is 
B,,-measurable. We call the sequence v={t,,¢,;72,¢ ;...} of such pairs {t,,€,} 
an impulsive control and denote by V the class of all impulsive controls. Put 


¥°(o)=X,o)+ LY &(o) (0.3) 


ti(@) St 


oo 


u(x)= inf E, [fe f(¥(w))dt+ 
veV 0 


oo k«é)| (0.4) 
i=1 
where f and k are given nonnegative functions on R” and [0,°)” respectively. 
We are concerned with finding an optimal control veV in the above variational 
problem (0.4). A similar problem has been considered by A. Bensoussan, J.L. 
Lions [1]. They have shown the relation between some quasi-variational in- 
equality and the impulsive control problem. More precisely, under the assump- 
tion that the solution of the quasi-variational inequality satisfies some regularity 
conditions, they proved that this solution turns out to be the minimal cost 
function corresponding to (0.4) and moreover that there exists an optimal 
control. But they did not discuss under what conditions their regularity assump- 
tions are satisfied. 
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In this paper, we first study the (uniform) continuity of the function u(x) 
given by (0.4) and then deduce the following properties of u(x): 


u(x)SE, [fe f(X)dr+e*u(X,)| (0.5) 
0 


for each stopping time t, and 

u(x)S inf {k(é)+u(x+)}. (0.6) 

E20 
Here €20 means that each component of € is nonnegative. Inequalities (0.5) and 
(0.6) express some strategical meanings. By making use of these inequalities, it 
becomes clear that the above impulsive control problem is closely related to 
the following optimal stopping problem: 
w(x) =infE, | fe-"f(X,)dt+e-™ M(X)| (0.7) 

t 0 

where 


M,,(x)= inf {k(€)+u(x+¢)} (0.8) 
é20 


and t ranges over all &,-stopping times (Theorem 1). Moreover we note that the 
minimal cost function u(x) is the largest of functions satisfying inequalities (0.5), 
(0.6) and some regularity conditions (cf. Remark at the end of §3). Finally we 
show the existence of an optimal control as the consequence of the above 
observations (Theorem 2). 


Now we state our result. Under the assumptions 


(A.1) & is a positive constant, 

(A.2) f(x) is a uniformly continuous bounded nonnegative function, and 

(A.3) k(&é) is nonnegative Borel function defined on [0,00 )" such that 
k(E,)+k(E,)—k(E,+&,)2c>0 for each €,,£,€[0,00)" 


where c is a positive constant, we have the following: 

Theorem 1. u(x)=w(x). 

Theorem 2. If there exists a Borel function €(x) such that 
SEER SEES: See Eee ee Ee 

then the control 6={7,,€,; t,,€,;...} defined below is optimal: 


ae” X,€D} 
E,(o)=2(X,,), 


k-1 
t,(w) = inf {124_.(0):X(o+ Y Eo)eD 


k-1 
E(a)=é (x.(0)+ : ¥ (o)), k=2,3,4,..., 
1 


where D={x;u(x)= inf {k(€)+u(x+ €)}}. 
g20 
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Remark. If k(&) is continuous and tends to infinity as |é| +00, then there exists a 
Borel function €(x) satisfying the assumption in Theorem 2 because the infimum 
of the right-hand side of (0.9) is attained at some €. 


§1. Proof of Theorem 1 (I) (u(x) < w(x)) 


For a &,-stopping time t and R"-valued random variable €, let 6° be the 
transformation of Q defined by 


X (83 @) = X 45), (@) + E(@). (1.1) 


We note that if € is ,-measurable, then (0:)-' @, cB, ,, espeacially, if t=0 and 
€ is By-measurable, then (6°)~' Z,=(03)-' B,<Z, (cf. Appendix). Further we 
note that if € is @-measurable, then (0°)—' F,c@Z, for each ts, where F, 
=0(X,,uSt). 

Now we prepare three lemmas for the proof of inequality u(x) < w(x). 


Lemma 1.1. u(x) is a bounded uniformly continuous function. 
Proof. (uniform continuity). By the definition of u(x), for each e>0 there exists a 


control V(e)={tj,€};75,¢5;...} such that 


u(x)2E, [fe f(yeaee ¥ e*ikG)| = (1.2) 
0 i=1 


We define another control #(e) = {#, & ; 7%, &;...} as follows; 
E(w)=t2(0? wm) n=1,2,3,... 
E(w)=E(0' mw) n=1,2,3,... 
where d€R". It is easily seen that i(e)eV (cf. Appendix). Then 
YO W)=Xo)+ YL Eo) 
t(@) Ss 
=X(0a)-5+ YF &(0’a)=Y(0? w)—5 


t{(0°@)<s 


Lek Ei) = Le ki a). 


Therefore for each yeR" 


E,| fem pysmyacs Lek E | 


—Eyss[ lems (nr) de+ Deke) | 





se, [fe 110476 o)—3)- f1r0(0* oy} dt] 


0 


+ , [fe F(X? de -E, 4; [fe foeo)dt] 


+lE, [seme Reo” o)| ~E, sf De-* Keo) 


1 
S_ sup|f(z—9)—f2- 


Here we made use of spatial homogeneity of the process (Q,4, B,, X,, P.,0,). 
Since f(x) is uniformly continuous, there exists 6(¢)>0 such that 


E, [fe-retyarr yet KE 
0 i 
ails [fe f(¥)dt+ Fe-* kED| <e 
0 i 


for each n with |y|<d(e). Take yeR" such that |x — y|<6(e). We can assume that 
u(x)Su(y). Then 


E,| fem peymyar+ Lethe) |-esuc'suo) 
0 i 


se, [fem seyyde+ Sek] 
0 i 


Therefore 
|u(x)—u(y)|<2e. 


Boundedness of u(x) is clear because the control such that t,, = 00 (n=1,2,...) 
is included in V and (A.1)~(A.3) are assumed. 


Lemma 1.2. u(x) SE, [fe Sf (X)dt+e-* u(X a for each stopping time t. 
0 


Proof. For each 6>0 there exist sequences {x,},, x,¢R" and {U, (6)},, where 
U,,(6) is a neighborhood of x, such that U U,,(6)=R", |y—x,|<6 for each 


yeU,,(6) and U,,(5)OU,,(6)=¢ if k+j. We can take a sequence of controls 
{v(n, )}, = {{tt*, €P°; ty, ...$}, CV such that 


E,. [fesree) dt+ 3 "tie Ker —eSu(x,), 
9 i 


n=1,2,...,for each e>0. 


=" *(w) ~ t8(Gre— Xoo w) 


Ere (o) = Ep (P"—* wv), 
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Then i(n, e)={z*, &*; ty, 25°; ...}eV (see Appendix). Since P, (tr*=zn', Em 
=¢* for all i)=1 it follows that 


E. [Se-nrarrmnnaes Deke 9| 
° i 
~By [Sescrrmoydrs eet Ke9| ne 
° i 


for each xeU, (6), with |5|<d(e) in the same way as the proof of Lemma 1.1. 
Here 6(e) is a positive number such that 


“if)- FO) <e holds whenever |x — y|<6(e). 


u™*(x)=E, [Femrermnndr ty eK]. 
0 i 


Then 
ju(x,,)—u™*(x,,)| <é€ 


and 
\u™*(x,)—u™*(y)|<e for each yeU, (6). 


Moreover |u(x,,) —u(y)|<e by Lemma 1.1, and therefore for each stopping time t, 


E,[e~**u(X )] 


E,[e~**u"*(X ); X,€U, (6)] 


E,[e~*u"*(X,); X,eU,,(6)]—3¢ 


1 


x E, be Ey. [ fe-erermeyas 
0 


n=1 
+Deror "ke |; X,€ U,,(3) |-3e 
Let v(e)={t;, 64; 75,65; ...} be a control such that 
™%}(w)=t(w) +7 °(0,@) if X,(w)eU,,, (0), 


and 
E(o)=E"(0,0) if X (w)eU,, (5), 
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k=1,2,...;n=1,2,.... Then v(e)eV (see Appendix). When X ,(w)eU, (6), 


¥°(o)=X,(o)+ Y Eo) 


té(@) Ss 


= X,(@)+ » E*(0,0) 


(0) +2 *(0-@) Ss 


=X,_0.0)+ PY &*(0,a) 
et £(0.@)Ss—t 


= ¥.5¢9(0, 00) 


for each s>t and 


Ye thE) = Le sete (E(B, co) 


=e eH eo) k(E*(9, 2). 
Therefore 
E, | fe"s(X)dt+e-“w(X,)| 
0 


>E, I! ent fy,2() at| 


+ ¥ y[ew ey, [fenesarmads+ Lethe 9]: X,e0,(6)] —3¢ 


=E,|fesorreyat] 
- : E, [5 e859) £(¥P"9(0, w)) ds 
+ Zensen won (E000); X,€U,,(8) | —3e 
=E, [ e"f (Y"°) de] +E, ey (Y2)dst+ Y e-*% KE| —3e 


=E,[fe-scyryar+ Y e*tktg9 | —3e 
2u(x)—3e. | 
Since ¢ is arbitrary our lemma has been proved. 
Lemma 1.3. u(x)< baal {2} +0 +-2)}. 


Proof. Let veV be an arbitrary control and define another control 6 
= {T,,¢,;T2,¢,;...} as follows: 


t,(o)=0, &,(@)=z 


T(@)=1;_ (Fo), E(@)=é,_ (Fw), i=2,3,4,... 
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where ze[0,0o)". Then 
Y¥(@)=X,(o)+z+ Yo E(F w)=Y(F o). 
1;(07@) Ss 


Therefore 


E,[fe-“sards+ Zeek] 


=k(2)+E, [ie (YF w))ds +¥ e- 74") (EF o)| 
0 i 


=k(z)+E,,. [fe reinds+ De ate). 


So we have 
u(x) Sk(z)+E,,,. [fe fapnds+ Demag) 
0 i 


Since the left hand side does not depend on z and v it hold that 


u(x) < inf {k(z)+u(x+z)}. 
z20 


Now we can easily prove that u(x)<w(x) as the consequence of Lemma 1.2 
and Lemma 1.3. We have 


u(x) SE, [femsrexvae +e7** u(x, 
0 


<E, [fe f(X)dt+e-™ M,(X)| 


for each stopping time t. So we have u(x)< w(x). 


§ 2. Some Lemmas 


In this section we prepare some lemmas to complete the proof of Theorem 1 and 
Theorem 2. 


Lemma 2.1. M_,(x) is a bounded uniformly continuous function. 
Proof. For each ¢>0 we can take 6(e) such that 
ju(x+z)—u(y+z)|<é 


for |x—y|<6(e) and for each z by uniform continuity of u(x). We can assume 
that 


M,(y)SM,(x). 
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By definition of M,(y), there exists €, such that 
M,(y)2kK(E)+ul(y+e)—e 
for each ¢>0. Therefore 
Ik(E,) +u(x+€,)—{k(E,) + uy t &)} Slu(xt &)—uly +E) <e 
for |x — y|<6(e). Hence 
IM,(x)—M,(y)|<2e for |x—y|<6(e). 
Boundedness of M,,(x) is clear. 


Lemma 2.2. G, f (x)— w(x) is an a-excessive function and w(x) can be written 


w(x)=E, [i e~* f(X,)dt+e-* M,(X,,)| (2.1) 
0 


where tp=inf{t20; X,eD}, D={x; w(x)=M,,(x)}. 
Proof. By the definition of w(x) and the strong Markov property, 
G,f (x)—w(x)=sup E,[e~**{G,f(X,)—M,(X,)}]. 


We note that our process (Q, Z, B,, P., X,,0,) is Hunt and has a Feller transition 
semi-group. So, by [3] (III. §3 Theorem 1) it is known that G, f(x)— w(x) is the 
smallest excessive majorant of G,f(x)—M,(x). Moreover, since G,f(x)—M,(x) 
is bounded continuous we can easily show (2.1) by making use of Lemma 10 
({3], 111 §4) and Lemma 3 ([3], III § 2). 


Lemma 2.3. 
w(x) inf {k(€)+w(x+)}. (2.2) 
é20 


Proof. It is easily seen that (2.2) follows from the definition of w(x) and the 
inequality u(x) < w(x): 


w(x) SM,(x)= inf {k(¢)+u(x + ¢)} S inf {k(C) + w(x + ¢)}. 
620 é20 


Lemma 2.4. Let g be an F-measurable random variable’, t be a stopping time and 
€ be a B,-measurable R" valued random vector. Then 


E,[g(6@ ); B]=E, (Ex, , [¢(@)]; B) (2.3) 
for each Be&.. 


Proof. Noting that our process has a Feller transition semi-group, we have only 
to prove (2.3) in case that 


g(@)=2,(X,,(@)) g2(X,,(@)).-- 8m(X,,,(@)), 
OSt, <t,<...<t,, g,€C,(R"), k=1,2,...,m 


' Here we prove this lemma for an ¥-measurable r.v. g, but we can extend it to a @-measurable 


r.v. g (cf. [2] Chap. 1 §5). 
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and 


N 
e(@)= > Ck T,,(@) 
k=1 


[2"c]+1 


where [,¢B, are disjoint and ¢,¢R" are constant vectors. t, = an 


. Then 
T, 7 as nO, and 


E,[g(63 @); B] 


= E,(81(X,,42+6)---Sm(X,,42+ 6); BY 
= lim E,[81(X,,42,+0)---8m(X42, +23 BY 


n— Oo 


= lim > E xLg1(X,, +b +h. Sm(X +f +); 


n—0o j= 1 


Boy se <<7)| 


= lim y ¥ EL, +L toy. Bm(X, mot oe)s 


n—0O j=1 k=1 
' j 
Bohol Ss on iy 


=lim 5 Y Eldon [g1(X,,)---8m(X,,]5 


nO j=l k=1 


j 
< 
Bokoy > seed 


= lim > E Ey i i, g81(X1,)---8m(X 15 


nC j=1 


oS al 
Boys sel} 


= lim E,[Ey, 5 <[81(X,,)---8m(X,,,)15 B] 


n—0o 


= E,[Ey.,-[g()]; B], 
because our process is spatially homogeneous and has a Feller transition semi- 
group. 
Lemma 2.5. Q(x)=G, f (x)—w(x) is a regular «-potential (cf. [2]). 


Proof. Since it is known by Lemma 2.2 that G, f(x)—w/(x) is a-excessive, we have 
only to prove 


lim E,[e~*"" Q(X7,)]=E,[e-*7 O(X;)] 


noo 
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where {7,} is any sequence of stopping times such that T,7T as noo. By 
Lemma 2.2 and the strong Markov property we have 


O(x)=G,f (x)— w(x) 
=E,[e-"?{G,f(X.)— M,(X,,)}] SE, Le-"? K(X,,)]- 


Therefore 


E,[e~*™ Q(X 7,)]=E,[e-*" Ex, [e-* K(X,,)1] 


=E,[e~%t tr,» K(X 7, ..p@7,0)()]- 


T,, +t p(07, o)=inf{t2 T,; X,¢D} finf{t2 T; X,eD} 
=T+1)(0,), 


it holds that 


lim E,[e~*7**°T K(X 7, , 5 07,0)())] 


no 


- E, [e —a(T+tp(@ro)) K(X rT. tp@ro)())] 


=E,[e~*" Ex, [e~* K(X,, = E,[e~*" Q(X ,)]. 


Here we made use of boundedness and continuity of K(x) and the quasi-left 
continuity of our process. 


§3. Proof of Theorem 1 (II) (w(x) <u(x), 


Now we can complete the proof of Theorem 1. 
Since G, f (x)— w(x) is a regular «-potential by Lemma 2.5, there exists a non- 
negative continuous additive functional? such that 


G,f(x)—w(x)=E, [fe aa,| 


(cf. [2]). Put 
m(o= ¥ E(o), 
j= 


E,[e~* w(¥°(t;—))] 


=E [eo JE [Je f(X)ar| 


—Ev%,-) [fe aa} 


This additive functional is perfect by the assumption that our process has a reference measure 
(cf. [2] Chap. V Theorem 2.1). 


2 
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oo 
-s _ tee [ fen sxpar]| 
0 
ee 
-£, fe" Exon a[le“a] 


=E. le a £(X,(on- w)dt| 


co 


~§, ee fendA(or-' w)| 
0 


ti-1 


=E. [fe FXO" w)dr| ik [ 


Ti Tt 


{ e-*dA(on- o)|. 


1 


Similarly 


E,[e*"* w(¥°(t;_\))] 


“a3 


3 


e-**f(X,(0"-1 @))dt— | e-*dA, (ON: o)| 
0 


ti-1 


Therefore 
E,[e~** w(Y"(t;—))]—E, [e-**"* w(¥(t;_,))] 


=E,|- { en F(X (Oreo) drt fee dA On-10)]. 
0 


ti-1 


E,Le-* w(¥"(r,—))—e-2- w(¥"(,_))] 


2-8, [ {esx or opdt]=-£,[ | e“sornae], 


Ti-1 imi 


By Blumenthal’s 0—1 law we know that P(t, =0)=0 or 1. In the case that P(t, 
=0)=0, putting t, =0 we have 


{E,[e~*" w(¥°(t,;—))J—E, Le" * w(¥"(z;_ 1} 


i=1 
=~ [fem repar] 


Therefore 


n—1 
—E,[w(¥o)]+ ¥ E,Leo** {w(¥"(t,—)) — w(¥"(c))}] + E, Le 7" w(¥"(t,—))) 
i=1 


i= 


2—£,[fe-“sarnae], 
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On the other hand we know by Lemma 2.3 
w(Y°(t;—))S inf {k(z)+w(Y¥°(t;—)+2z)} 
z20 
Sk(C)+w(¥"(z))) 
and 
E,[w(¥)]=w(x) 


by our assumption. So we have 


oo 


wos, [fem snares ek(E)|+EsLe* w(¥" (5D) 
0 1 


Since 1, > 00 as nO, it holds that 


woE,| fe" fyydr+ yee) 
0 i= 


for each ve V. Hence we obtain w(x) <u(x). 
In the other case that P.(t, =0)=1 we have 


E,[w(¥°(t, —))J=E,[w(X o)] =w(>). 
Therefore it holds that 


n—1 


E,L0(¥"(e—)— WOE HE, Bele tw(¥—)— WMD] 
2 w(x)—E, [fe-rorar] —E,[e~*" w(Y"(t,, —))]. 
0 


We can then obtain w(x)<u(x) in the same way as above. 


Remark. Put E={g(x);G,f(x)—g(x) is a bounded a-regular potential and 
g(x)<M,(x)}. Then by the above proof of the inequality w(x)<u(x) we can see 
u(x) is the largest of functions belonging to E. 


§ 4. Proof of Theorem 2 
First we will prove that 
u(Y°(z,))SM,(Y¥°(z,))—c for each n 
where c is the constant in the assumption (A.3). From the equality 


u(Y°(¢,)) +k(é,)= inf {u(Y°(t,—)+z)+k(z)} 
it follows that ’ : 
u(Y°(E,)) Su(¥°(¢,—) + &, +2) +k(E,+2)—k() 
Su(Y°(z,)+z)+k(z)—c 


for each ze[0, 00)". 
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Therefore (4.1) holds and we know 


Y°(z,)eD° 
for each n. 
As the consequences of (4.1) we can prove the next lemma. 


Lemma 4.1. P.(lim ¢,=00)=1 for each n. 


n—0o 


Proof. For each e>0, there exists 6>0 such that 
|u(x)—u(y)|<e and |M,(x)—M,(y)|<e 
hold whenever |x — y|<6. Then from (4.1) it follows that 


u(Y°(¢,)+x)-eSM,(Y°(t,)+x)+e—c, for |x|<6. 
Take &<5 and fix 6 satisfying (4.2) for such ¢. Then 


u(Y°(t,)+x)<M,(Y°(¢,)+x) for |x|<6. 
Therefore ; 
Y°(t,)+xeD° for |x|<6. 
Now we define a stopping time o=inf{t=0; |X,—X,|>6}. Then 


Ty 1(@)=T,(@) + Ty: (a Gin) y) 


2T,(w@)+o(0,, @). 


B,[e*1SE fe) 
=E,[e~*-'E, [e“ll. 
Since E,[e~’]=h is a constant we obtain 


E,[e~*]<h". 
Then 
lim E,.[e~*"]=0. 


noo 


Therefore 
P.(lim ft, =00)=1. 
Now we can give the proof of Theorem 2. 
Since t)(w)=t(w)+7p(0,@) for each stopping time t<Tp, 


t(@) + tp(@-@) 


wex)=E, | f ef (X dt e200) MX. ,05(0,0))| 
0 


E.| e~* f(X,)dt+e-* Ey. [fessexyarte MX,,)]] 


E,| 


e~* {(X,)dt+e-** wx). 
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Putting t=Tp, 
u(x)=E, [ fe-"s(X,dt+e-*” u(X,,)}; 
0 


because u(x)=w(x). Therefore 
u(x)=E, [fenmr Y*)dt+e-** u(Y°(t, — »| (6.3) 
0 


On the other hand X, = Y°(¢, —)eD because D is a closed set. Therefore by our 
assumption 

w(¥*(@, —=K(E(Y(E, —)) +ul¥(#, -) + (VCE, —) in 

=k(,)+u(Y*(é,)). 
u(x)=E, [fe-seydr+ene Ké)| 
0 
+E, [e~**u(Y*(é,))]. 
Now 
E,[e~**u(¥°(z,))] 
=E, |e Eys ga | fe" F(X) dt re"? w(K, 


0 


HE, Jet Ey, .4,| Lets dtte-*u(X,,)] 


0 


(6.6) 


ma) ; ' ‘ 
ne, [e“} f efx (Ohzapdee--eid WX sole HO] 
0 


=E, | T es (¥)dt+e-a u(Y°(t,., -| 


tn 


for each n. Then 


u(x)=E,| fe-*f(Y)dt-+e* KE) +e“ (YE, —)| 
0 


by virtue of (6.6) for n=1. Hence we obtain 


u(x)=E -[few sae ydt+ y et k(Eyte-%u( Y(t »| 


i=1 


by induction. Lemma 4.1 now completes the proof. 


Appendix 


Let F be the family of all probability measures on R". We denote by F" (resp. 
F) the completion of F (resp. completion of F, in F“) with respect to P,, ueP, 
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where F =o(X,,0St) and F7,=0(X,,s<t). Our process (Q,B, B,, P., X,,9,) is of 
function space type, that is Q=D([0,co)»R"), B= ie and &,= () F, 


ue? 
where D([0,00)~»R") is the space of all right cenlbiniaaias functions on [0,0) 
with lefthand limits. 


We will derive a lemma concerning measurability of the transformation 65 


defined by (1.1). 
Lemma. Let t ie a B,-stopping time and € be a @,-measurable random vector. 


Then (6°)-' B,cB,.,, for each s>0. 
Proof. First we note that ()-' Zc@B,,,. Indeed {Be F;; (0)-' BEB, .} is a o- 


algebra and contains any cylinder set belonging to %. Therefore we have only 


to prove that for any probability measure ye there exists another probability 
measure ve# such that P.o(62)~-'=P, according to Lemma5.6 of Chapter! in 
[2]. It follows from Lemma 2.4 that 


F,(;)~* B)=E, (1,4; 0)] 
=E, (Ey .Up()]] 
=f PR(B) P(X,+£éedy) for each BeZ,. 
Put v(dy)=P,(X,+éedy). Then 
P,((6:)~* B)=P(B). q.e.d. 


Now we will make some remarks. 


Let v={t,,¢,372,2;---} be an impulsive control and 7 be a By-measurable 
random vector. Put 


T(@)=7,(0" o) 
E (a) =&,(0" o). 
Then s={7,,2,;7,,2,;...} is also an impulsive control. For it holds that 
{w; 7(@)<t} =(0")—! {t,(@)<t}eB,,t>0 
and 
{o; &(@)EB} 0 {@; 7;(@)<t} 
=[(0")* {E,(@)eB}] O[(6")~* {t,(@) <t}] 
=(0")~* [{E(o)eB} 9 {t,(@)<t} Je, 


for each BE &(R") by the above lemma. 


Next let v"={ui, 7; “3,13, ---} be a sequence of impulsive controls and t be 
a stopping time. Put 


fi(@)=t(w)+y}(0,@) if X (w)eU,, (6) 
ni (@)=n7 (0, w) if X,(w)eU, (5), 


where U, (6) is a neighborhood of x, introduced in the proof of Lemma 1.2. Then 
we can prove that 5={{1,,7,; 12,2; ...} is also an impulsive control. Since T;(@) 
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=t+/(0,«) is a stopping time and @,c &,, for each n and i, 


{ii,(@)<t} =) [{X (o)eU,, (5)} 0 {7} <t}] eB, 


for each i. Therefore we have only to prove that 
{j,(@)EB} 0 {E;(@) <t} 
=) [{X,€U,,(5)} 0 {nj}, o)eB} 0 {t +n} (8, @) <t}] Ee, 


for each i and Be @(R"). It holds that 
{X EU, (5)} 0 {n} (8, o)EB} 0 {t+ 70, @) <t} 
= ) [{X,€U,,6)} 0 {ni0,0)eB} {u7O, 0) <s} 0 {t<t-s}] 


Oss<t 
rational 


= VU) [X,cU,, 0} 0 {t<t—s} 67 *({neB} 0 {ut <s})] 
Oss<t 
rational 


{X EU, (5)} N{t<t—s}EB,_,<B, 
and 
{t+s<t}n0>'[{nieB} cn {uj <s}JeB, 
because 
O>* [{nfeB} 0 {ui <s}JeB, ,,. 


Hence we get our conclusion. 
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1. Introduction 


Let F denote the convolution semigroup of probability distributions (the 
operation is the convolution, denoted by *, corresponding to the addition of 
independent random variables). The expectation, denoted by E(F), satisfies 


E(F, * F,)=E(F,) + E(F,) 


whenever the expectations on the right-hand side are finite, that is, it is a partial 
homomorphism. Our aim is to derive the following 


Theorem. There exists a homomorphism o: F¥—>R* (R* denotes the additive 


group of real numbers) such that o(F)=E(F) for every F having a finite 
expectation. 


The difficulty is caused by the fact that F is not cancellative, ie. F*F, 
=Fx*F, does not imply F,=F, (the Khin¢éin phenomenon, see e.g. Feller [2], 
p. 479). In Sect. 2 we develop an algebraical method to eliminate this problem, in 
Sect. 3 we apply it to our case and in Sect. 4 we state several further results and 
open problems. 


2. An Algebraical Lemma 


Lemma. Let S be a commutative semigroup, S' a subsemigroup of S, G a group and 
Ww: S'+G a homomorphism. For the existence of a homomorphism go: S>G 
satisfying p(s)=wW(s) for every seS’ the following condition is necessary: 


ss,=ss, implies w (s,)=W(s2) 
if seS,s,,s,€S’. (1) 
If G is divisible, then (1) is also sufficient. 


This may be well-known, but we cannot give any reference. 
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Proof. The necessity of (1) is evident. To prove sufficiency write s,~s, for s,, 
s,€S if there is an seS such that ss, =ss,. ~ is evidently a congruence relation, 
and the factor-semigroup S,=S/~ is easily seen to be cancellative. Let «: S-S, 
denote the canonical homomorphism. Our condition (1) just means that yw is 
compatible with ~, and hence it induces a homomorphism /,: Sy—G, where So 
=a(S’), such that p=yyou. 

S,, being cancellative, can be embedded into a group H and wy, can be 
extended to a homomorphism w,: H’>G, where H’ is the subgroup of H 
generated by S). As G is divisible, by a theorem of Baer [1] (see also Fuchs [3], 
Chap.IV. §21) this homomorphism can be extended to a homomorphism 
~,: HG, and now we can set p=@,°4. 


3. Proof of the Theorem 


By the Lemma we have to show only that F, *F = F,*F implies E(F,)=E(F,) for 
F, F,, F,¢-¥%. Denote the characteristic functions of F, F,, F, by f, fi, fa 
respectively. As the expectations are finite, we have 


E(F)= —iff0) (G=1,2). (2) 


F*F,=F*F, implies ff,=ff,. f is continuous and assumes the value 1 at 0, 
hence we have f,(t)= f(t) for sufficiently small values of t, and therefore 


f{0)= f2(0). (3) 
(2) and (3) imply E(F,)=E(F,). 


4. Problems and Results 


1) It is natural to ask whether such an extension can have any “good” 
properties. It is easy to see that it cannot be monotonic; however, we proved 
that it may have the weaker property of being positive for positive variables and 
negative for negative ones. 

2) The additivity of expectation is not restricted to independent variables. 
Can a real number ¢(€) be assigned to every random variable € being additive 
(ie. satisfying o(€,+¢,)=(E,)+(€,) for all €, and €,), depending only on the 
distribution of € and coinciding with the expectation if it exists? We conjecture 
that the answer is negative. 

3) Besides its additivity, for independent variables the expectation is also 
multiplicative. Can an extension have this property as well? That is, together 
with the additive convolution * we may also consider the multiplicative con- 
volution @, defined by F,®F,=F if there are independent variables having 
distributions F, and F,, whose product has a distribution F. The operations * 
and @ turn F into a quasi-ring (not a ring, as * does not have an inverse). Does 
there exists a mapping go: FR satisfying (i) 9(F,+*F,)=o(F,)+o(F,), (i) 
0(F, @F,)=0(F,) o(F,), (iii) ep(F)=E(F) whenever the expectation is finite? We 
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can show that the latter two conditions can be satisfied if (i) is dropped. 

4) As the variance (or any other cumulant) is also a partial homomorphism 
and is completely determined by the behaviour of the characteristic function at 
the neighbourhood of 0, it can also be extended for every variable. However, if 
we seek this extension of variance in the form 


4(6)=9((E —(6))’) (4) 


where @ is an extended expectation, we generally fail. Can one construct @ so 
that the mapping 6 defined by (4) will be also a homomorphism? This is 
connected with problems 2) and 3). 
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1. Preliminaries 


Let {n,}? be a sequence of integers such that 
n,,/h2q>1 for each k. 


For a real sequence {a,} and @ in [0, 1] we consider the series 


C,(0)= > a, cos(22n, 9), 
k=1 


S,(0) ¥. a, sin(22n, 6), 
k=1 


Z,(0)= 3 a, exp(27in, 4). (3) 


k=1 
By Sidon’s theorem a necessary and sufficient condition for the everywhere 
convergence of these series to a bounded function is that Ta converges 
absolutely. A necessary and sufficient condition for the iain everywhere 
convergence is that ya? converges. The fact that {exp(2zin,@)} behaves 


1 
somewhat like a sequence of independent identically distributed random vari- 
ables (see [6] and [7]) has led to a number of probabilistic results on the 


@ 
asymptotic behaviour of these series when )'a; diverges. The results that have 


1 
been obtained (e.g. the central limit theorem [10] and the law of the iterated 
logarithm [11] and [12]) have usually been of more interest to probabilists than 
to complex analysts. 
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The explanation for this probabilistic behaviour is to be found in the 
following invariance principle for lacunary series obtained by Philipp and Stout 
(see [9], p. 60). Let 

By=A=3 Ya 
k<N 


and if we[0, 1] define 


S(t,@)= ¥ a, cos(2mn,@) when te[ Aj, Aj, 1). 
kN 


Proposition 1. Suppose that Ayo as N- and that there exists d€(0,1] such 
that a,=o(1) A; ~* as k-00. Then without changing the distributions of S we can 
redefine the process {S(t),t20} on a richer probability space together with a 
standard brownian motion X(t) such that 


|X (t)—S(t)|=o0(1)t*?-* as t>00 (4) 
for each 1<6/32. 


In [9] it is shown how the central limit theorem and the law of the iterated 
logarithm are immediate consequences of this result. 

Our present work was motivated by a desire to find conditions on {a;} which 
are necessary and sufficient for the almost everywhere divergence to infinity of 
the series (1), (2) and (3). In [8], p.157, Kahane showed that if X is a one 
dimensional brownian motion and {t,} is an increasing sequence satisfying 
teai—tact,—t,_1 for n=1,..., 


then }'t;*<oo implies that |X(t,)|>00 almost surely, whilst if }'t7 += 00 then 
the sequence {X(t,)} is almost surely dense on the line. Similar behaviour for the 
complex brownian motion is governed by the condition )'t; '< 00. Were it not for 
the error term in (4), the invariance principle, with t,=2B, and {a;} monotone, 
could be applied to yield 


);By*<oo implies |C,(0)|>00 for almost all @ whilst )°B,*=0o implies that 
for almost all @ the sequence {C,(0)} is dense on the line. 


Similar results could be deduced for S, and Z,. Unfortunately, even if the 
condition 2 < 6/32 could be replaced by 4<6, the error term is too large to allow 
us to obtain a result of this type. However, these observations do point the way 
to the truth of the matter. 

In this paper we obtain a direct approximation to (n,x)=n,x—[n,x], by 
independent random variables each uniformly distributed over [0,1]. Our 
results are applied in Sect.3 to give information regarding the almost every- 
where behaviour of lacunary trigonometric and other lacunary sums. In Sect. 4 
we change from considering the almost sure asymptotic behaviour of sums to 
looking at the @ sets where series have exceptional behaviour. In these cases we 
consider the Hausdorff dimensions of the exceptional sets. We also give some 
applications to lacunary power series. 
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2. Approximation to (n, x) 


Let [x] denote the integer part of x, and (x)=x—[x]. Let {n,}f be a sequence of 
integers. We say that {n,} is strongly lacunary if n,,,/n,>1 for all k and 


¥n,/n,,., is finite. Thus, for example, k! is not strongly lacunary but (k!)? is 
1 


strongly lacunary. Strongly lacunary series were used, implicitly, in [4]. 


Theorem 1. Let {n,} be a strongly lacunary sequence of integers. If 0<6<1, 
there exist: 

a) a sequence of independent random variables {0,}, each uniformly distributed 
over [0,1] and defined on some probability space Q; 

b) a subset A, of [0,1] of measure less than 6; 
and 

c) an invertible map T:Q—[0, 1]\.A; such that Tand T~' take sets of positive 
measure into sets of positive measure and such that for x¢€ A, 


\(n, x) —0,(T~ * x)| $3n,/n,, 5 
provided k= p=p(6). 
Proof. Let m,=[n,., ,/n,]—1, and I(k) be the family of intervals [j/n,, (j+ 1)/n,), 
j=0,1,...,n,—1. Then each interval of I(k) contains m, (or, (possibly, m,+ 1) 


intervals of [(k+1). We fix a large integer p and define, inductively, a family 
J(k), k= p of intervals. Let 


J(p)=1(p). 


If J(k) has been defined we let J(k+1) consist of all those intervals of I(k+1) 
which are strictly contained in some interval of J(k). Thus J(k+1) consists of 
clusters of m, (or m,+1) intervals of length nz’,. For each cluster which has m, 
+1 members we remove the interval furthest to the right. The resulting family is 
called J(k +1). 

We define K(p) to be the intersection over k=p of the union of all the 
intervals in J(k). Then K(p) is a generalized Cantor set and the Lebesgue 
measure, 1(K(p)), of K(p) satisfies 


ALK (p)]21 ee ny/My 5 4 
so that “2 
A[K(p)] 1 as poo. 
If xe K(p) we define 
a,(x)=[n,x] 


and if k>p we let «,(x) be the number of consecutive intervals of I(k + 1) wholly 
to the left of x, which are contained in the same interval of J(k) as x. Then 


a, (x)e{0, 1, ...,m,—1} 
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|, (2x)/m, — (nm, X)| S2/m,. (5) 
Now define N,=n, and, for k>p, N,=m,N,_,. Let Y=[0,1] be a second 
copy of the unit interval and define 


BAy)=LN,¥] 


and, for k>p, 
B.(y)=LN,y] mod m,. 
The map 
V:K(p) > Y 
defined by 
Vx=y 
if and only if 


a(x)=By) for k2p, (6) 


is such that V and its inverse take sets of positive measure into sets of positive 
measure. Under Lebesgue measure on Y the family {6,} k2p are independent. It 


will now be shown that B,/m, can be approximated by independent random 
variables, uniformly distributed over [0, 1]. 


Let {,} be independent identically distributed random variables, each 
uniformly distributed over [0, 1]. Let 


9, =(N, ,) (7) 
so that {0,}° are also independent and uniformly distributed over [0, 1]. Let 


7x =LN,%,] mod m,. 
Then 


|y,m ,—9,| <1/m, (8) 


and {y,}> have the same joint distributions as {f,}°. Thus if Q is a probability 
space supporting {¢,}% there exists a measure theoretic isomorphism between Y 
and Q. The result now follows from statements (5) to (8). 


3. Oscillation of Series 


Suppose that {X;}? are independent identically distributed R* valued random 
variables. For a fixed sequence {a,} we define the weighted partial sums 
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We shall assume that the sequence (a,} satisfies the following conditions: 


a;20 for alli; 


B,-> © and max a;=o(1)C, as noo 


1Si<n 


B,=).a? and C,=)4,. 
i 1 


We say that the sequence is d-smooth if 


m=1 n=1 


N m N 2 
> & B,Bn»—By) = O(1) (x B, **) as Noo. (12) 
n=1 


Note that if {a;} is monotone increasing or if a;=j*,a>— then a ; is d-smooth 
for d=1 or d=2. 

The following proposition can be proved by combining the asymptotic 
estimates for the distribution of T, (obtained by Bretagnolle and Dacunha- 
Castelle in [2]) with Kahane’s argument on p. 122-124 of [8]. 


Proposition 2. Suppose that a; is as above, and that {X,;} have finite variance and 
mean zero. If 


Y Br *#<0a 


n=1 


|T,,| +00 with probability one. 


If d=1 or d=2, a, is d-smooth, and 


ie.8) 
>, B, *4=00 
n= 1 
then, with probability one, 


{T,} is dense in R‘. 


Since we can approximate (n,0) by independent random variables, Pro- 
position 2 allows us to deduce results on the asymptotic behaviour of lacunary 
series. For the rest of this section we shall suppose that {n,} satisfies the 
conditions 


a 


dimly 1 <2 (13) 


Dagny 1 <0. (14) 
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Theorem 2. Suppose that {a,} satisfies the conditions (9) to (11) and (12) for d=1, 


and that {n,} satisfies (13) and (14). Let S,(0) and C,(0) be as defined in (2) and (1). 
Then if 


Y Brt<a 
1 


|S,,(0)| + 00 and |C,,(0)|-+00 as n> 00 


for almost all 0. If 


YB, t= 0 


1 
then, for almost all 0, the sequences {S,(0)} and {C,(0)} are dense on the line. 


Proof. Consider the sequence {S,(0)}. Suppose that 0<6<1 and that A;, T, Q 
and {6;} are as in Theorem 1. Let 


X (@)=sin(270;(@)) on Q 
and 
Y,(x)=sin(2720,(T~' x)) on [0, 1]\A;. 


The properties of T ensure that 


and 


have the same almost sure asymptotic behaviour on their respective domains. 
Now 


|sin(2n, x)—sin(270,(T ~' x))| S$2a\(n, x)—0,(T~* x) 
S2nn,/n,, 


provided k is sufficiently large and that x¢A;. Thus, by (14), if x€ A; 
i Yi(x) 


converges to a finite limit as n->oo. Thus S,(x) has the same almost sure 

asymptotic behaviour as )\ a; Y,(x), and so has the same almost sure asymptotic 
n 1 

behaviour as )'a; X;. The result, for S,(0), now follows from the facts that 6 is 


1 
arbitrary and that we can apply Proposition 2 to the random sum. A similar 
argument gives the result for C,(6). 
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Note that the same argument applies if we replace sin x and cosx by any non 
trivial periodic function which satisfies a Lipschitz condition of order one. The 


argument can also be used to give the following result on the behaviour of 
complex sums. 


Theorem 3. Suppose that {a,} satisfies the conditions (9) to (11) and (12) for d=2, 
and that {n,} satisfies (13) and (14). Let Z,(0) be as defined in (3). Then if 

> B, ' <0 

1 


|Z,(9)| ~00 }=0as n> 
for almost all 0. If 


then, for almost all 0, the sequence {Z,(0)} is dense in the plane. 


It would be interesting to know to what extent these results are valid for 
general lacunary series. Presumably the condition on {n,} can be relaxed 
somewhat but the fact that the finiteness of } B>' may change if we reorder the 
coefficients shows that some restriction on {a,} is necessary. It seems difficult to 
get very far by using complex variable methods. The problem lies in the fact that 
we have no asymptotic estimates for 


P{Z,€D(a,b)} 


(D(a,b) being the disc centre a with radius b). The central limit theorem only 
gives estimates for 


P{Z,,B-'eD(a,b)}. 


Even the following problem does not seem to have been solved. 
Let 


Q,(0)= yr exp(27 in, 0). 
1 


Is it true that whenever {n,} is lacunary {Q,(0)} is, for almost all 0, dense in the 
plane? When n,=(k!)? the answer is yes (by Theorem 3). Complex variable 
methods can be used to show that there exists r>0 such that for almost all 0 


lim sup D(Q,,(0),r) = R?. 


noo 


4. Asymptotic Behaviour 


In this section we consider the exceptional behaviour of the series 


N 
Zy(0)= > exp(2zin, 9). 
1 
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The results we obtain are special cases of those in the next section but the 
simpler proof serves to motivate later arguments. 


The Hausdorff measure * is defined by u*= lim y§ where 
60+ 


u3(A) = inf Y'|S,\"* 


and the infimum is taken over all countable covers of A by sets, S;, of diameter 
|S;| less than 6. The Hausdorff dimension of a set B is then given by 


dim B= inf {a: u*(B)=0}. 
Suppose that A€(0, 1) and define 


5(4)= () {0: cos(2an,0)=A for all k>N}. 
N=1 


Standard arguments show that, if inf n, , ,/n,>3, S(A) has positive dimension for 
some A, and that, if n,, ,/n, 00, S(A) has dimension one for each 2. 
It follows from (15) that for all 0 


IZy(OSN 


and the law of the iterated logarithm for exponential sums (see [11]) ensures 
that for almost all 6 


Z,(0)=O(1)(N loglogN)* as Noo. 
On the other hand Zygmund, in [12], has shown that 
lim sup |Z y(9)|/N >0 
N00 
on a set that is dense in [0,1]. In [5] Erdés and Taylor showed that this 
exceptional behaviour takes place on a set of dimension one. The fact that 
lim inf|Z,(@)|/N2A_— on S(A) 
No 
shows that an even greater growth rate is possible when infn, , ,/n,>3. 


We now consider the special case n,=2". By looking at the points k2~?, 
k=1, 2, ...,2?, p=1,2, we see that 


Zy(0)=N + O(1) 
on a countable dense set. More generally we have: 
Theorem 4. Suppose that Z,(0) is given by (15) and that n,=2". Then 
{0: Z,(0)~cN as N 00, for some c+0} 


has Hausdorff dimension one. 
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Proof. Suppose that 0<p<1 and that q=1—p. Let {X;}f be a sequence of 
independent random variables each with distribution given by 


P(X,;=0)=q and P(X,;=1)=p. 
Let 


and let m be the measure on [0,1] defined by 


m(A)= P(X €A). 
Since 


1 : 3 
q § sin(2 x) dm(x)=(q—p) | sin(2x x) dm(x) 
0 0 


1 
f exp(27ix)dm(x)=0 
0 


if and only if p=q. 
Let 


Bax ¥ DEXYN — p as N20}, 
1 


Billingsley (see [1] p. 141 onwards) shows that if A is such that m(BOA)>0 then 
dim(Bo A)=e(p)= —(p log p + q log q). 


Next consider the transformation Tx =2x mod 1. The following facts can easily 
be checked: 


i) m is supported by B (use the strong law of large numbers); 
ii) TB<B; and 
iii) T preserves m and is ergodic with respect to m (as is shown by a 


modification of the argument on p. 12 of [1]). Thus the ergodic theorem ensures 
that 


N 1 
Yexp(22iT* x)/N > | exp(27ix)dm(x) 
1 0 
for m almost all x. Thus if p+q 
N 
¥ exp(22i 2* x)/N +c+0 
1 


on a set of dimension at least e(p). As sup {e(p)} =1 the result follows. 


5. Asymptotic Behaviour (General Case) 


Now let {a,} be given a sequence with a,20 and set 


N 
Zy(0)=)¥. a, exp(27i 2* 8). 
1 
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We define Cy by 


N 
i 
Theorem 5. Suppose that Cy 00 as N +0. Then if, as N +0 
ay=O(1) Ci-* for some 5>0, 


the set 
{0: Z,(0)~2 Cy as N +0 for some 4+0} 


has Hausdorff dimension one. 


Proof. Let X be the random variable constructed in Theorem 4 and let m,(X,dx) 
be the random measure obtained by putting atoms of mass a,/C, at the points 
2)X mod1, j=1,2,...,n. First we show that 


P{m, converges weakly to m}=1. (16) 
It is enough to check that for each dyadic cylinder ]=[k2~?,(k+1)2~”) we have 
m,(I)->m(I) almost surely. 


Then the fact that there are only a countable number of such cylinders means 
that this convergence takes place for all J with probability one, which implies 
(16). 

Define ¢,(x)=[2"x]mod2. Then any dyadic cylinder I can be specified by a 
sequence {/,}7, 2,=0 or 1, by means of 


I={x: ¢,(x)=A,; k=1,2,..., p}. 
Define the random variable Y, to be the indicator of the event 
Xi4,=4, k=1,2,...,p. 


Then EY,=m(I) and Y, and Y; are independent if |i—j|>p. 
If we apply Theorem 1.1.15 of Theodorescu [5] to the random variables 
f, =a; Y, we see that 


m= ¥ a,¥/C, 
i=1 
—m(I) 


almost surely, provided that (a,/C,)? converges. However for some K and 
some 6>0 we have a;< KC} ~° for all j. Thus 


Y(a/C)? <K Ya; c;* +6) 


which is finite by Dini’s theorem. 
Thus, for m almost all X, m,(X,dy) > m(dy) weakly. Hence 


1 1 
! exp(22ix)m,(X,dx) > f exp(272i x) dm(x) 
0 
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for m almost all X. If p+q the latter limit is non zero and the conclusion follows 
in the same way as it did in Theorem 4. 


6. Lacunary Power Series 


We now obtain some results for a class of lacunary power series. First we let 


S(r, 0)=) a, r™ exp(27 in, 0) 
1 


M(r)=sup S(r, 0)=)\ a, r". 
8 1 


Lemma. Suppose that a, =0, that 


Qn41~4, asn-o 


and that 


max a;=o(1)C, as no. 


1<i<n 
Let N(r)=max{k: r>4}. Then, for all 0, 
IZ nr(9)— S(r, A)|=0(1) Cy, as r>i—. 
Proof. First choose an increasing sequence {p,} of integers such that 


P,70 and a,p,=o(1)C, as n>, 
and 


284, ,,/a,S2 provided |i| <p,,. 
We define p(r) by 
P(T)=Pneys 
and, where there is no confusion, we write p for p(r) and N for N(r). Now 
1—r sn{1—r). 
The fact that r“”>4 shows that 


r>exp(—log2/N(r)) 
and so 
1—rsSlog2/N(r). 
Thus 
1 —r S(n,/nyp) log2. 


Now we express S(r, 0) in the form 


S(r, 0)=0,(r, 0)+0,(r, )+0;,(r, 0) 





J. Hawkes 


o,(r,0)= } a,r"exp(2zin,9), 
N-p 


js 


o,(r,0)= YY a,r%exp(2xin,6), 
li-Nl<p 


o,(r,0)= YY a,r"exp(2zin,9). 
j2N+p 


Comparison with a geometric series shows that 
|o3(r, 0)|=O(l)ay,, 

=O(1) ay 
=0(1) Cy): 


\o,(r,0)|S2p(r) max a, 
li-Ni<p 


<4p(r) ani) 


lo(r, 0) ‘on Zy_ pl 4 ; 


Finally 


N 
IZy(0)—Zy_ AIS » a; 


P 
S2(p(r)+ 1) An(r) 
=0(1)Cyy as rai-. (20) 


The result follows by combining (17), (18), (19) and (20). 
Let B(r)=()\azr?")*. Then, according to Weiss [12], for almost all 4 we 


have 
lim sup |S(r, 0)| [B(r)? log log B(r)]* = 1. 
putas 


We can now examine the exceptional set for a special class of lacunary series. 
Theorem 6. Suppose that n,=2*, that a,2=0, and that a,~a,., as k +00. Then if, 
for some 6>0, 
a,=O(1)Ci-* as ka 
the set 
{0: S(r,0)~AM(r) as r-+1— for some 4+0} 


has Hausdorff dimension one. 
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Proof. The preceding lemma with 0=0 shows that 
M(r)~Cyy, asral—. 
The result then follows from Theorem 5. 


This result admits interpretations in the language of cluster sets. For 


example let f ()=S2?. The unit circle is then the natural boundary of f and it 
0 


can be shown that almost all points of the unit circle are Plessner points (see 
[3], p. 147). Our result shows that the set of Fatou points (with limit value oo) 
has Hausdorff dimension one. 


Remarks. 1. I would like to thank the referee for his valuable comments and 
suggestions for improvement. 

2. The proofs of the results in Sect. 3-6 remain valid if we assume only that 
n,= [| r, where the r; are integers which are at least two. 

jsk 

3. We have chosen in this paper to emphasize those problems where 
probabilistic techniques yield better results than are available through the 
techniques of complex analysis. Presumably Theorems 3, 5 and 6 hold in much 
greater generality but at present they seem beyond the range of both pro- 
babilistic and complex analytical methods. 
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Scaling Transformations for {0, 1}-valued Sequences 


G.L. O’Brien 


Department of Mathematics, York University, Downsview Ontario, Canada M2N 3T5 


Summary. Let r=2 be an integer and let g: {0,1}’- {0,1} be a function. Let 
T be the transformation on Q={0,1}7 given by (To)(i)= ((ri), (ri 
+1),...,@(ri+r—1)) for all ieZ. For P in the class of strongly-mixing shift- 
invariant measures on Q, we investigate when P is invariant with respect to 
T and when T"P converges. For example if r is odd and 9(@p,...,@,_,)=1 
iff }\;>4r, the invariant measures are the Bernoulli measures with means 0, 
+ or 1 and T"P must converge to one of these three measures. Other choices 
of g can give more complicated behaviour. 


1. Introduction 


Let Z denote the set of integers and let Q= {0, 1}7. Let § denote the o-algebra of 
subsets of Q which is generated by the cylinder sets. Let $: 2-Q be the shift 
transformation on Q; that is (Sw)(i)=@(i+1). Throughout the following, mea- 
sures on 2 will be taken to be shift-invariant probability measures on the 
measurable space (Q, %). We will consider w(i) to denote the 7th component of 
an element weQ or, when convenient, to denote the i’th term of a strictly 
stationary sequence {q(i)} of {0,1}-valued random variables defined on some 
other probability space. In the latter case, one measure on (2,%%) is the 
distribution of {q(i)}. The reader should have no difficulty distinguishing 
between these two interpretations. 

Let ~: {0,1}’->{0,1} be a function, where r is an integer greater than one. 
The scaling transformation corresponding to @ is the transformation T: 2-Q 
defined by 


(Tw)(i)= g(@(ri), o(rit+ 1), ...,@(ri+r—1)). (1) 


We call g the scaling function for T and r the scale of T. Let P be a measure on 
Q and let P, be the measure on 2 given by 


F,(A)= T" P(A)= P(T~"(A)). (2) 


Research supported in part by the Natural Sciences and Engineering Research Council of 
Canada 
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The purpose of this paper is to determine which measures are invariant under T 
(i.e., to solve the equation P, = P) and to determine for which P the sequence {P} 
converges. The main results of the paper are given in Sect. 3, where we show that 
for P in a large class, and for certain scaling transformations T; P is T-invariant 
iff P is one of a finite set of Bernoulli measures. The measures in the set are 
found by solving a polynomial equation obtained from g. Furthermore, for such 
T, P, converges for all P in the indicated class. 


Of particular interest are the majority transformations obtained by taking r 
odd and using the scaling function 


r—1 
j 1 
1 if )@,>4r 
Fy 
P(@o, 1, ---,@,_1)= ' 
ae 


0 if }a,<tr 
i=0 


for all (ao, ...,@,_ ,)€{0, 1}". 

The question of identifying the invariant measures for majority transfor- 
mations was posed by Professor F. Spitzer, who conjectured that the only non- 
trivial invariant measure is the Bernoulli measure with probability of each 
outcome being one half. In Sect. 4, we show that these transformations satisfy 
the requirements of our theorems and that Spitzer was essentially correct. In 
Sect. 6, we present several other invariant measures which are not in the large 
class to which our theorems apply. 

Majority and other scaling transformations can be used to model a number 
of physical phenomena. Spitzer, motivated by the work of Mandelbrot (1977) on 
scaling, considered a person in an airplane observing the earth’s surface below as 
being land or water. Incomplete resolution causes him only to detect the effect 
of successive intervals of length r. He records a one or zero (land or water) for 
each interval in a way that depends on the pattern of zeros and ones within the 
interval. The most natural situation would have him record the value that 
predominates (i.e. holds the majority) within the interval. 

A similar example is obtained by considering a forester who walks through 
his domain and classifies each section as disease-ridden if the fraction of 
diseased trees he examines in the section exceeds some specified value. 

A third example is obtained by considering a voting system in which the 
electorate is divided into cells of size r. Decisions are made by first voting within 
each cell and then having each cell (or its representative) vote in accordance 
with the majority within the cell. This could be extended to several stages by 
having cells, subcells, etc. (In practice, of course, representatives do not always 
heed the majority of their “cells”.) 

Finally, we note that Wilson (1979) has used a two-dimensional majority 
transformation to study the Ising model of magnetization. 

Another interesting class of scaling transformations are the sum modulo two 
transformations obtained by defining 


r-1 
(Wo, @4,---,@,_1)= ¥@; (modulo two). (4) 
i=0 
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If we denote T by T,, we observe a semi-group structure for these transfor- 
mations: 


T,-T,= T,s- 


(This equation does not hold in the case of majority transformations.) The sum 
modulo two transformations are analoguous to the transformations considered 
by Sinai (1976) and more recently by Taqqu (1979) and others, in connection 
with self-similar probability measures on (— 00,00)”. It is easily seen that the 
hypotheses of the theorems of Sect.3 do not hold for sum modulo two 
transformations. The conclusions of the theorems hold for P in a more restricted 
class but not for all P in the class considered in Sect. 3. These results are given in 
Sect. 7. 

It should be noted that our methods and conclusions are quite different from 
those of “block transformations” given by 


(U w)(i)= p(w(i—3(r— 1), o(i-2(r — 3), ..., o§ + (7 — I) 


for odd r. If @ is the majority function, the invariant measures include all 
measures which are concentrated on sequences all of whose runs have length 
greater than 4r, since such sequences are fixed points of U. Block transfor- 
mations have been studied by Coven, Hedlund and Rhodes (1979) and by 
Miyamoto (1979), among others. 

It is obvious that scaling transformations are not invertible. Dekking (1979) 
investigates transformations which are right inverses of scaling transformations. 
To be precise, his transformations substitute specified words for each symbol. 


2. Classes of Measures 


In this section, we give descriptions of some classes of measures on (Q, %) which 


will play a role in the study of scaling transformations. Recall that measures are 
assumed to be shift-invariant and to have total mass 1. 


The most important measures are the Bernoulli measures. If P is Bernoulli 
and P(w(0)=1)=p, we write P=B,. Of course, B, is the distribution of an iid. 
sequence {a(i)} of {0,1}-valued random variables with P(a(i)=1)=p. 

Let %(j, k) denote the o-algebra of sets in § generated by sets of the form 


{weEQ: w(i)=@,;,j Sisk}. 


We consider the class .@ of strongly mixing measures on Q; Pe.@ if there exists 
a function g on the positive integers such that g(k)|0 as k-0o and 


| P(AB)— P(A) P(B)| Sg(k) (5) 


for all AE%(— 00,0) and Be ¥(k, 00). The Bernoulli measures are the measures in 
M for which (5) holds with g(1)=0. A second useful subclass of .@ is the class 


M, of 2-dependent measures; that is, the set of Pe.@ for which (5) holds with 
g(2)=0. 
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We will say P is a Markov-based measure on Q if P is the distribution of a 
sequence {f(X,)} where X,, is a stationary aperiodic positive recurrent irreduc- 
ible Markov chain with a countable state space I and f is a function from J into 
{0, 1}. It is well known that every Markov-based measure is in .@. A special case 
is obtained by assuming J = {0,1} and P is the distribution of {X,,} itself. 

Various classes of measures on (Q, %) are invariant under scaling transfor- 
mations. Included are the classes of Bernoulli measures, shift-invariant measures, 
strongly mixing measures, 2-dependent measures and Markov-based measures. 
In the last case, the state space of the underlying Markov chain may have to be 
enlarged if it is finite. The property of being the distribution of an irreducible 
aperiodic stationary Markov chain on {0,1} is not generally preserved. For 
example, if T is the majority transformation with scale 3 and P is such a 
distribution and has a symmetric transition matrix, then TP is such a distribu- 
tion if and only if P=B,. 


3. Invariant Measures and Limits 


Lemma 1. Let Pe.@ and let T be a scaling transformation. Every subsequence 
{P,} of {P} has at least one accumulation point (i.e., probability measure to which 


some subsequence of {P,} converges weakly). If Q is an accumulation point of 
{P,}, then QE.M,. 


Proof. First note that & is the Borel o-algebra with respect to the compact 
product topology on 2. By Prohorov’s theorem, every subsequence of {P} has a 
convergent subsequence (cf. Billingsley (1968), p. 19). We note in passing that 
P,, > Q (weak convergence) iff P,(A)>Q(A) for every cylinder set A (Billingsley, 
p. 37). We make frequent use of these facts without explicit mention. 

Let Q be an accumulation point of {P}. Since each P. is shift-invariant, so is 
Q. Let AEG(— 0,0), BEG(2, 0). Then T~"AE¥(— 0, r"—1), T~"BEF(2r", «). 
Therefore 


|Q(AB)— Q(A)Q(B)| Slim sup | F,(AB) — F(A) F,(B)| 


=lim sup |P((T~-"A)(T~"B))— P(T-" A) P(T -"B)| 


< lim g(r”)=0. 
Thus Qe./,. This proves Lemma 1. 
It would be nice if we could improve Lemmal by showing that each 
accumulation point Q of {P} must in fact be a Bernoulli measure. It will be seen 
in Sect. 7 that this is not always the case. The next result shows that it is the case 


for some scaling transformations. For each scaling function @ with scale r, we 
define two sets: 


C=C, ={weg: p(0, w(1), o(2), ..., a(r—1)) 
+ ~(1, w(1), w(2), ..., a(r—1))}, 
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C=C), ={meQd: p(w(0), w(1), ..., o(r — 2), 0) 
+ ~(w(0), (1), ...,2(r—2), 1)} (7) 


Lemma 2. Let Pe. and let Q be an accumulation point of the sequence {P}. If 


lim inf P(C A C’)<1, (8) 


then Q is a Bernoulli measure. This holds in particular if 
Q(COC)<1. (9) 
Proof. We deduce from (8) that either 


a =lim inf P(C)<1 (10) 


n> © 


or the same holds with C replaced by C’. We assume (10), the other case being 
similar. Let AeE¥(— 00, —1) and Be (0,0). It suffices to show 


Q(AB)=Q(A) Q(B). (11) 
Let e>0. Let {P,,} be a subsequence of {P} such that 
P,(C)<}(1+a) (12) 


for all k and such that 
m,,,—m,22 (13) 


for all k. Let | be sufficiently large that 

U.+a)]'<e. (14) 
Let {P,} be a subsequence of {P} which converges to Q and assume 

n,>m, (15) 

for all k. For k sufficiently large, 

Ig(r™-'")\<e. (16) 
Let A’=T™-'~"™(A) and B’=T™-'~"(B). For weEQ, define w*EQ by 

or={P 0 = er 


Then let B*={weB’:w*teB}CB’.. If weB’—B*, then T™%~™-'weB but 
T™-™-'@*¢B. This in turn implies that weC, TweC,...,T™~"-'~'weC. 
Since Be (0,0), we use (12)-(16) to obtain 
B. (F-F9GE (Cat: * Ca... ATO") 
ENE AGF iets Mo Po lpamitiony, *) 
Sh AOR... (C)-. £I(O+lee 
<2e. 
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Observe that B*e% (1,00) and A’e%(— 00, —1). Thus 

| F,, (AB) — F,, (A) P,, (B)| =| B,, 

=) ae 


+|P 


mie-1 


»- (AB) — Fry, (A) Pn (B)| 
(4’B’)— P(A’ B*)| 
(A’B*)— PF, _ (A) Fn, (B*)| 
+F,,_ (40 1E.,_,B*)—£,,.,B I 
S2F,,,_,(B’— B*)+g(""-') 


<5e. 


Since ¢ is arbitrary, (11) follows and Q is Bernoulli. It is obvious that (9) implies 
(8) so the proof of Lemma 2 is complete. 

The requirement that (8) holds is very inconvenient. For some scaling 
transformations, including the majority transformations, it holds for all Pe.@. 


By contrast, for any scaling transformation whose scaling function g has the 
form 


P(Wo, My, ---,@,_ 1) =Dyp + W(@ 4, 5, ...,O,_ 2) +.@,_1, (17) 


where addition is performed modulo 2, the condition fails for all Pe.@. The 
main examples are the sum modulo two transformations. Some partial results 
for such transformations are given in Sect. 7. 

Associated with each scaling function @ is a function x=7,: [0,1]—[0, 1] 
defined as follows: 


m(p)= >) a;p'(1—py (18) 
i=0 


where a; is the number of r-tuples (@ 9, @,,...,@,_ ,) in {0, 1}” which are in g~1(1) 
and which have exactly i ones and r—i zeros. The fact that O<2(p)<1 for all p 


follows from the fact that 
ae40, | ERO (1) 


for each i, so that in particular 


gS: ae , 
nip) ¥ (;) pid—py-'=1. 
i=0O 
The importance of the function z is derived from the following fact: 


TB,=Byy- (19) 


It follows that the Bernoulli measures which are invariant under T are the 
Bernoulli measures B, such that p is a fixed point of z, i.e., such that m(p)=p. (At 
least one fixed point exists.) Let x! =z and, for n=2,3,..., let x"(p)=x(x"~1(p)). 
If P=B,, then P=B,,,,). Thus, any limit theorems for {P} inevitably involve the 
limiting behaviour of the sequence {x"(p)}. 
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Theorem 1. Let T be a scaling transformation and let Pe.@. Assume 
P(COC)<!1 (20) 


for some n. For k=1,2,...,R=P iff P=B, where n*(p)=p. In particular, P is 
invariant under T iff P=B, where p is a fixed point of n. 


Proof. Suppose R. =P. Then the sequence {P} is periodic so that (20) implies (8). 
Also, the subsequence {P,} converges to P as noo. By Lemma 2, P=B, for 
some p. The results now follow from (19). 


Theorem 2. Let Pe.M@ and let T be a scaling transformation. Assume P satisfies 
(8). Also assume x is not the identity function (n(p)#p) and 


n*(p)=p = n(p)=p (21) 


for all pe[0,1]. Then P, converges to a Bernoulli measure B, such that n(p)=p. 


Proof. By Lemma 2, there is a subsequence {P,} of {PR} such that PR, > B, for 
some pe[0, 1]. It follows easily that P, ,, => B,.,,) for /=1,2,.... Now Sharovsky 
(1966) (see also Kloeden, Deakin and Tirkel (1976)) showed that (21) implies 
m'(p) must converge to a fixed point py of a as Ico. Since the set of 
accumulation points of a sequence is closed, there must be a subsequence {P,,} 
of {PR} which converges to B,,. Let a(n)=P(w(0)= 1). Obviously, a(m,)— po. 

The result is trivial if x is a constant function; if z is a constant it must be 0 
or 1, which means 4g is also constant. Since 2(0) and z(1) can both only equal 0 
or 1, there are only two first degree possibilities for 1, namely x(p)=p and 
m(p)=1-—p, and these are excluded by explicit assumption and (21), respectively. 
We may therefore assume x has degree two or higher. It follows that there is a 
neighbourhood N of py such that for all peN — {po}, 


n(p)+p, (p)+2po—p, (p)+Po and x’ (p)+0. (22) 


We will assume throughout that 0<p,)<1. The other two cases are somewhat 
easier since some of the situations described below cannot occur. We will say a 
property holds “to the right (left) of p.” if there exists an ¢>0 such that the 
property holds for all pe(po,po+e) (respectively, (py—&,po)). The proof that 
P,= B,,, is split into various cases according to how many of the inequalities 
m(p)>p and n(p)>2p,—p hold to the right and left of po. This gives nine cases, 
but two are excluded by differentiability of x at py and two others are excluded 
by symmetry arguments. 

We begin with the case for which x(p)>2p)—p for p to the left of py and 
1(p)<2p)—p for p to the right of py. This case cannot occur without violating 
(21). To see this, observe that (21) is equivalent to the condition that the graphs 
of the function z and its inverse intersect only on the diagonal of [0,1] x [0, 1], 
and then use the continuity of z. 

Next, consider the case for which 


p<nx(p)<2py—p (23) 
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to the left of py and 
2Po—P<X(p)<P (24) 


to the right of po. (This includes the case |1’(py)|<1.) Let e>0. Suppose there is a 
sequence r,— 00 such that 


[Po a(r,)| SE (25) 
but 


IPo—a(r, + Di>e (26) 


for all k. By Lemma 2, the sequence {P,} has a subsequence {P,,} which 
converges to B,, where |p—po|Se. This implies P,, ,, => B,,,, where |2(p)—po|<é 
provided ¢ is sufficiently small. In particular |a(s,,,)—po|<e for k sufficiently 
large, which is a contradiction of (26). Since |p, —a(n)| Se for infinitely many n, it 
follows that |p,—a(n)|><« for only finitely many n. Since ¢ is arbitrary, a(n)—po. 
By Lemma 2, it follows that P,=> B,, in this case. 

We will make frequent use of the subsequence argument used above but will 
henceforth give fewer details. 

The third case we consider is the case for which 2(p)<p to the left of p) and 
m(p)>p to the right of po. If P,=> B,,, there must be infinitely many n such that 
|a(n)—po|Se but |a(n+1)—p,|>e, provided ¢>0 is sufficiently small. We will 
show this leads to a contradiction. There are several subcases of which we single 
out one to give in detail. Assume that for a sequence of e’s decreasing to 0, there 
exist infinitely many n such that |a(n)—p,|Se but a(n+1)<p,)—«. Assume also 
that there exists a point p, <p, for which z(p,)=7(p,). If more than one such p, 
exists, choose the largest value. By the continuity of z, it is easily seen via the 
interpretation of (21) in terms of the graph of x (given earlier) that z(p)>p, for 
P,SpSpo. Let 6>0 be such that x(p)>p,+6 for all pe[p,,po]. Choose ¢ 
sufficiently small that 2¢+d<p )—p,, 2(p)<po—2e for all pe(p, +06, py—2¢) 
and x(p)<p for all pe[p )—2é, po). Some subsequence of {P,}, with n as 
described above, must converge to some B, and evidently pp—¢Sp Ppp while p, 
+6<x(p)<p,—s. It follows that there are infinitely many n such that p, 
+6Sa(n)Sp.—e but such that a(n+1)Sp,+6 or a(n+1)Z2p)—«. Taking a 
subsequence of {P,} for such n’s gives a contradiction as in the previous case. We 
conclude that P,> B,, in this case also. 

We next consider the case for which x(p)<p to the left of p, and to the right 
of po. (This implies 2’(p))=1 and x’(p)<O near but not necessarily at py.) 
Suppose that for infinitely many n |a(n)—po|Se but ja(nt+1)—po|>e. If a(n 
+1)>p)+¢ for infinitely many n, we get a contradiction as in the second case 
while if a(n+1)<p,)—e for infinitely many n, we get a contradiction as in the 
third case. The case for which x(p)>p to the left and right of po is similar. 
Alternatively, it can be converted to the case just considered by switching 0’s 
and 1’s in Q. This leads to the replacement of z(p) by 1—72(1—p), whose graph is 
obtained by rotating that of x by 180° about (5, 3). 

The final case, except for a similar symmetry argument, has (23) to the left of 
Po and n(p)<2p,—p to the right of pp (so that 2’(p))= —1). Let e>0. Suppose 
there exist infinitely many n such that |a(n)—po|<e but |a(n+ 1)—po|>2e. If ¢ is 
small enough, this leads to a contradiction by the subsequence argument used 
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before. Now suppose there are infinitely many n such that |a(n)—po|<e, |a(n+1) 
—Pol>« and |a(n+2)—po|>«. Choose a sequence {r,} of such n’s for which {P,} 
converges, say to B, where |p—po|Se. It is clear that p>p, and then that 
1(p)< Po and n7(p)> po. By (21) and the continuity of z, it is easy to deduce that 
1?(p)<p. By the subsequence arguments used before, this gives a contradiction. 
Therefore |a(n)—po|<2¢ except for finitely many n. It follows in this case also 
that P,=> B,,. This proves Theorem 2. 

The assumption that (21) holds is essential. For suppose x*(p)=p but 
(p) +p. Then take P=B, and observe that P, oscillates with period 2. It follows 
from Theorem 2 that if (21) holds there cannot be any pe[0, 1] such that 2"(p) 
fails to converge as n> 00. This is a special case of results of Sharkovsky (1964, 
1966). See also Stefan (1977). The assumption (21) can be relaxed by weakening 
the conclusion of the theorem, as follows. 


Corollary 1. Let Pe.M and let T be a scaling transformation. Assume (8) holds. 
Further assume n(p)# p, m(p)#=1—p, and for some k 


n*""(p)=p <> 1?"(p)=p (27) 
for all pe[0,1]. Then {P,} is asymptotically periodic with period 2". Specifically, 


Prien > B 


n+l m'(p) 


as n> 00, for 1=0,1,...,2*—1, for some pe [0,1] for which 1" (p)=p. 


Proof. Observe that the transformation T = T*" is again a scaling transformation 


on Q with scaling function @ and with corresponding function #: [0,1]—[0, 1] 
given by #=77". The degree of # is at least that of x and as a result 7 is not the 
identity function. Also # satisfies (21) by (27). Finally, (8) must hold for at least 
one of the sequences {P,,x,);, n=1, 2, ...}. The result holds for that value of | and 
hence for all /. 


Condition (27) requires that there be no pe[0,1] such that the sequence 
{x"(p)} have (minimal) period 2**!. By the results of Sharkovsky (1964, 1966), 
this implies that the only possible periods for such sequences are 1, 2, 4, ..., 2*. If 
there exists a pe [0,1] such that {2"(p)} has period not of the form 2* for some k, 
then there also exists qe[0,1] such that {x"(q)} and hence {T"B,} is not 
periodic or even asymptotically periodic. Examples of this situation can be 


r-1 


constructed by taking r large and taking g(@o, @,,...,@,_,)=1 iff } o; is 
i=0 


close to }r. (For such examples, the function z is the operating characteristic 
curve for a test of the hypothesis p=4 for a binomial distribution with the 
two-sided alternative p +4.) 

If x is non-decreasing (21) holds automatically. If 2 is non-increasing, 1? is 
non-decreasing so that (27) holds with k=1. 

We do not know if the assumption 2(p)#p is essential. One type of scaling 
transformation which gives x(p)=p is obtained by taking ~(@o, @,,..-,@,_ 4) 
=), for some ie {0, 1, ...,r—1}. In this case P,(w(0)=1) is constant as a function 
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of n so that P,=>B, where p=P(w(0)=1). The situation is less clear for other 
scaling transformations for which z(p)=p; similar difficulties arise if x(p)=1-—p. 


4. Majority and Related Transformations 


Let r be a positive integer greater than one and let ke{0,1,...,r}. Let 
gy: {0, 1}’+ {0,1} be given by 
r-1 
1 if } wk 
i=0 


(Wo, M1, ---»O,_ 1) = 0 


otherwise. (28) 


If k is taken to be 4(r+1) for odd r, the corresponding transformation is a 
majority transformation. 


Theorem 3. Let T be a scaling transformation with scaling function given by (28). 
Let Pe .M. Then P, converges to a Bernoulli measure B, where p is a fixed point 
of x (given by (18)). Also, P is invariant iff P=B, for such a p. 


Proof. We show Theorems 1 and 2 hold. First, observe that 2(p)#p. Also z is an 
increasing function of p so (21) holds. It remains to check (8), which we do by 
proving (9). Let Q€.@,. Let C be as in (6) and suppose Q(C)=1. Then 


O({a: w(1)+...+@(r—1)=k-1})=1. (29) 
By shift-invariance 
O({w: w(0)+a@(1)+...+a@(r—2)=k—1})=1. (30) 
The combination of (29) and (30) implies 
O({o: o(0)=a(r—1)})=1. 


Thus Q is concentrated on the set of sequences of period r—1, which is 
impossible. This completes the proof. 

Finding the fixed points of x is often difficult or impossible since it involves 
the solution of a polynomial equation which may be of high degree. In the case 
of majority transformations we do have an explicit solution. 


Theorem 4. Let T be a majority transformation. Then Theorem 3 holds and the 
fixed points of x are 0, 5 and 1. 


Proof. We need only solve the equation x(p)=p for O<p<1. It is useful to 
indicate the dependence of z on r by writing x=7,. We will show that for k>1, 


kK (2i-1\ , ; 
ans)=P+ > (", ) pl-p'@p—N). G1) 
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First, for k=1, we have 2,(p)=3 p?(1—p)+p° which does reduce to (31). Assume 
we have shown the result for k—1. Then 


_ 3848 Ahehy , ' 
TiiP= > ( ) pit —ppetrt-' 


i=k+1 


- > {P*>") 42 pe Cy) ota ppt 


eh Abn Hi . , at ek. : 
a a ( ; ) pict —py?*+t-!42 > ( : ) opp 
i=k+1 l i=k l 

** ee... 

ba ( , ) pq —ppet 

i=k-1 
28-1 Qk-1\ , 
=> () )ea-pe*t tap +20 -p) +I 


i=k 


a as k-1 


k ) pra—pyt+ (°  )oh tp 


2k—1 
=t0)+("; ) pp 2p-0). 


Thus (31) holds for all k=1. The terms on the right side of (31), except p itself, 
are zero for p=0, 5 or 1. For 0<p<}4 they are all negative and for }<p<1 they 
are all positive. The theorem follows from these facts. 

The notion of majority only makes sense for odd values of r. We generalize 
the notion to even r as follows. There are an even number of r-tuples in {0, 1}’ 
which have exactly $r ones and 4r zeros (since they can be paired by switching 
zeros and ones). Select half them and define (wo, @,,...,@,_,)=1 iff either 
¥@;>4r or (wo, ...,@,_) is one of the selected r-tuples (in the case } @;=47). 
By definition the function x=7, is given by 


1,(p)=4 (:,) p?"(1—p)*"+ : (") pi(1—p)-!. 


2 i=4r+1 


Consider the new scaling function @ defined by $(w, ...,@,_,)=1 iff }@,>4$r 
or w,_,=0 and }o,=}r. Then @(@p,...,@,_,)=1 iff the majority of 
Wo, @1,---,@,_> are 1’s. It is therefore clear that 

1,(p) = t,(p)= Tl, _ 1(p). 


To obtain the analogue of Theorem 4 for r even, we need only check that (9) 
holds for all QeE./@,. Let us suppose (0, 1,0,1,...,0,1)=1, the other case being 
similar. Then 


1—Q(C)=QO({w: w(1)=o(3)=...=a(r—1)=1}) 
=[Q({a: w(0)=1})]}*">0 
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unless Q({@: w(0)=1})=0. In the latter case, 
1-—Q(C)=O({o@: w(0)=a(1)=...=a@(r—1)=0})=1. 


Thus, Theorem 4 extends to these “even majority” transformations. 


5. Which Limit is Obtained? 


In cases where Theorem 2 applies, there may be several fixed points for the 
function z. We do not have very good methods for deciding which limit is 
obtained for a given P. 

If P=B,, lim P,=Biim_znp), 80 the problem can be solved by determining 
lim x"(p). This can be done (in principle) by examining each fixed point and 
observing whether it is “attractive” or “repulsive”, as determined by the 
behaviour of z near the fixed point. Suppose for example that T is a majority 
transformation. It is easy to see that 2"(p) converges to 0, 5 or 1 according as 
p<4, p=} or p>4 (that is, 0 and 1 are attractive and 4 is repulsive). 

If P is not Bernoulli, the problem is much more complicated. Take for T the 
majority transformation with scale r=3. A sufficient condition for P,> B, is 
that P be a symmetric measure. A sufficient condition for P, => B, (or P, => B,) is 
that P be stochastically less (respectively, greater) than B, for some p<} (resp. 
>4). See for example Kamae, Krengel and O’Brien (1977). 

One might conjecture that P,=> By, B, or B, according as P(w(0)=1)<3, 
=4 or >4 since this is the case if P is Bernoulli. The following example shows 
this to be false. Let {X,,} be a stationary Markov chain with states 1, 2, 3 and 4 
and transition matrix 


1 
2 
3 
4 


Co CO fF Re 
So OC Fh NR W 


Then P(X, =1)=P(X,=4)=3 while P(X,=2)= P(X, =3)=3 for all n. Let f(1) 
= f(2)=0 and f(3)=f(4)=1. The distribution of { f(X,)} has the property that 
P((i)=1)=4 while P,(w(i)=1)=;5. A slight perturbation of the above matrix 
gives an example for which P(w(0)=1)<4 but P,(w(0)=1)>4. At least one, and 
by symmetry both, of the statements 


P=>B, if P(@(0)=1)<4 
and 
P=>B, if P(w(0)=1)>4 


must be false. 
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6. Measures Outside .# 


It is not the case that the only measures which are invariant with respect to a 
scaling transformation are Bernoulli measures. We demonstrate this by giving 
several examples in the case when T is the majority transformation with scale 3. 

First, let P be the measure which puts an atom of $ at each of two sequences 
of alternating zeros and ones. Then P is S-invariant and T-invariant. Moreover, 
P({w: To=q@})=1. It is clear that S is ergodic with respect to P although the 
tail field is non-trivial. On the other hand, T is not ergodic with respect to P. 
Since 


| P(w@(0)= a(n) = 1) — P(@(0)= 1) P(@(n)=1)|=4 


for all n, P is not strongly mixing. On the other hand, if P puts unit mass on 
one of the two alternating sequences, then P is strongly mixing and T-invariant 
and T is ergodic with respect to P, but P is not S-invariant. A similar example is 
obtained by letting P have an atom of size } on each of the four sequences 
...01100110011.... 

There is also at least one S- and T-invariant measure which has no atoms. 
Let Q assign values in such a way that 


Q(c(2i)=0, w(2i+ 1)=1)=O(w(2i)=1, w(2i+1)=0)=4 


and such that all pairs (w(2i),@(2i+1)) are independent. Let P=4(Q+SQ). 
Then P is S- and T-invariant. Again, S is ergodic with respect to P but T is not 
ergodic, and P is not strongly mixing. 


7. Sum Modulo Two Transformations 


Let T be the transformation obtained by defining @ as in (4). We observed 
earlier that Lemma2 does not apply to T. We now investigate when the 
conclusions of Theorems 1 and 2 hold for T. 

As a first step, we examine the function 2 which we denote here by z,, where 
r is the scale. It is easily seen that 


n,(p)=p+(1—2p)z,_,(p). (32) 


It follows from (32) and the fact that 0<z,_,(p)<1 for pe(0,1) that the fixed 
points of 2, are 0,4 and, for even r,1. It also follows from (32) and induction 
that the only solution of ,(p)=4 is p=4. This implies that (21) holds. 

Let Pe. be such that {P,} converges to a Bernoulli measure B,. This 
assumption eliminates the need for Lemma 2. It therefore follows from Theo- 
rem 2 that p is one of the fixed points of z,, namely 0, 4 or 1. It follows that 


r—i 
P( > x cx()=1(mod 2)) = P,(cx(0)=1)+0, $ or F (33) 


i=0 


This fact can be viewed as a local limit theorem for sums modulo 2. 
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We will next show that for some very simple measures P€.@, (33) fails and 
hence {P,} cannot converge to a Bernoulli limit. Let {X,, ne Z} be a stationary 
Markov chain on a probability space (’, %’, P’) with state space {0, 1, 2,3} and 


with transition matrix A given by 
0) @) @ &) 

(0) | « 0 0 1-« 

(1) | o 0 0 l-a 

(2) | 0 a l-a« 0O 

(3) | 0 a l-a O 
where O<a<1. Let f: {0,1,2,3}— {0,1} be given by f(0)=f(2)=0 and f(1) 
= f(3)=1. Let P be the distribution of {f(X,)}. Then P is Markov-based and, 


since the rows of A? are all given by the stationary measure (a7, «(1 —«), (1 —«)?, 
a(1—«)), Pe@,. 


Let S,= a f(X;) (mod 2). It is easily seen that {(X,, S,), n=0, 1,...} is also a 


positive cocaine Markov chain. Its eight point state space has I , = {(0, 0), (1, 0), 
(2, 1), (3, 1)} and J, = {(0, 1), (1, 1), (2,0), (3,0)} as its closed sential classes. 
Furthermore (X 9, S,) must be one of (0,0), (1, 1), (2,0) or (3, 1) with probability 
a7, a(1—a), (1—a)?, «(1—a) respectively. Thus P’((Xo,S))¢€1,)=% and 
P’((X9, So)€1,)=1—«a. The usual limit theorem for Markov chains now shows 
P'(S,=1)->2a(1—«a). It follows that {P,} can only converge to a Bernoulli 
measure if «=4. Actually, it can be shown directly that P is invariant under T 
for all «€(0, 1). 

We next show that for “most” Markov-based Pe.%, it is the case that {P} 
converges to B,. Suppose the Markov chain {X,} has the property that there 
exist states iy,i,,...,i, and jo=ip,j,,jo,.--.j,=i, such that P’(X,=i,|X,_, 
=i,_,;)>0 and P'(X,=j,|X,_;=j,_1)>0 for h=1,2,...,k and such that the 
number of h’s for which f(i,)+ f (j,) is odd. It is easily seen that this is equivalent 
to the Markov chain {(X,,, S,)} being irreducible and aperiodic, which in turn 
implies P’(S,=1|X )=x)—4 for all x. Then, for example, P,(w(0)=1, w(1)=1) 
=P'(S,n_,=1, S,2n_,=0)—4, so that P, has a Bernoulli limit. 
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Introduction 


Let Q be a domain in R4(d>2) and put Q,={x,ER':(x,,...,Xy_1,XgEQ for 
some (x,,...,X4_,)€R4~'}. m denotes a nonnegative Radon measure on Q,, i.e. 
m(1) is finite for every compact set I of Q,. Let m(dt)=m,(t)dt+m,(dt) denote 
the Lebesgue decomposition. Namely, dt is the one-dimensional Lebesgue 
measure and m, (dt) is singular with respect to dt. We assume the following: 


(1) inf{m,(t): teU}>0 for every bounded open set U with UcQ,. 

Let us define positive Radon measures v,(dx)=v, (dx, ...dx,4), 1Si<d on Q by 
v(dx)=dx,...dx,_,m(dx,) if 1Sijsd—1, 

dx =dx,...dx4_,dX,q otherwise. 


(2) vfdx)=4 


Consider the functions a,;, 1 <i,j<<d on Q having the properties: 


(3) For each i,ja,,=a,, and a,; is of C*°-class on Q, where dy=[(d—1)/2]+1. 
Moreover for any compact set K <Q there is a positive constant y=)(K) such 
that 


d a d 
— z &? vi(dx)S Y Gi €j4ij(X) vij(dx) Sy 5 &} v,(dx) xeK, eR‘. 
iad 1 fad 


i,j= 
Now we shall introduce the bilinear form 2: 


BL2=CF(Q) 
d 
Au,v)= ¥ J 


i,j=12 


(4) Ou. Ov 

ax, (x) dx, (x) a; (x) v; (dx). 

Here C}(Q)=the space of C®-functions with compact supports in 2. As will be 
seen later, 2 is closable on I7(Q; dx) with inner product (, ),. Therefore by using 
M. Fukushima’s results [4] we obtain a diffusion except for a set of zero 
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capacity whose Dirichlet form coincides with the smallest closed extension & of 
2. In this paper we get however a stronger result: 


Theorem. There exists a unique diffusion process on Q, whose resolvent R, satisfies 
the following: Let A>0. (i) R,f is continuous on each compact set in Q for every 
feL?(Q;dx)AL?(Q; dx) with p>d. (ii) R,f is a unique solution of 


R,feDlé] 
E(R,LP)+AUR AL Pa=(f Pa ~PECS(Q), 
for every fel?(Q; dx). 


A probabilistic approach due to N. Ikeda and S. Watanabe [9] leads us to a 
nice diffusion in the case where m(dt)=dt+6,,(dt) and a;;=6,;. On the other 
hand, following the arguments of M. Kanda [11] and H. Kunita [12], who get 
diffusions corresponding to m(dt)=dt, the author [16] constructed a diffusion 
associated with & for my=1 and m,=a sum of 6-measures. A number of 
analytical preparations were made in [16]. Our theorem will be proved along 
the same lines. 

First we construct diffusions on R* under additional restrictions on m and 
a;; In Sect. 1 some function spaces relevant to v are specified and their 
properties are studied. In the first half of Sect. 2 {G,:A>0} is defined by 
solutions of (5). It will be then shown that G, maps C, into C,, densely, where 
Cy [resp. C,,]=the space of continuous functions having compact supports 
[resp. vanishing at infinity]. To this end auxiliary lemmas are needed. The major 
part of their proofs is closely follows arguments about elliptic partial differential 
equations which are due to L. Nirenberg [13] and G. Stampacchia [14]. In the 
last half, H. Kunita’s method [12] is used to construct diffusions on R‘. In Sect. 3 
we prove the theorem by employing M. Fukushima’s result [6]. Section 4 is 
devoted to the study of polar sets. As will be seen, any polar set of our diffusion is 
also polar with respect to the Brownian motion, but the converse is false. 


(5) 


1. Preliminaries 


Throughout this section we suppose that m is defined on R' and satisfies 
(6) Azinf{m,(t):teR'}>0, M=sup{m([i,i+1]):i=0, +1,...}<oo. 
v and v,,'s are defined on R¢ by (2). 


Function Spaces. Let Q be an arbitrary domain in R*. We shall discuss the weak 
differentiability of functions on Q. We use the notation: D;=0/0x; for 1Si<d, 
D‘, = D' ... Di4-) for an d—1 tuple k=(k,,...,k,_,) of nonnegative integers, and 
|k|=k,+...+k,_,. Let weL,(Q;dv) and |k|>1. If there exists a function 
veL,,(Q; dv) such that 


fuD’.pdv=(-1)"[vgdv peCFH(Q), 
Q a 
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then such v is determined uniquely up to sets of v-measure zero and it is denoted 
by D*.u. We call them the weak derivatives (with respect to v) of u. For 1<i<d 
and ueL),.(Q;dx)D;,u stands for the distributional derivative, that is, D,u 


belongs to L’,,.(Q; dx) and satisfies 


JuD,9dx=—|D,ugdx peCZ(Q). 
Q 2 


D*,u, Du etc. are defined in the same way. 
For any | <p< oo we introduce the space #?(Q; v): 


U?(Q; v)= {ue L(Q; dv): D,ue L(Q; dv), 1 Sisd—1, Due L?(Q; dx)}. 


By the same method as used in Sobolev spaces we see that: ue L?(Q; dv) belongs 
to %(Q; v) if and only if for each 1 Si<d there is a version u' such that u=uw' y,,- 
a.e. and u' is absolutely continuous (with respect to the one-dimensional 
Lebesgue measure) on 2,(x) for f;,-a.e. &;,, and its (usual) derivative du'/dx; 
(which exists v,,-a.e.) belongs to I(Q;dv,). Here x=(x,,...,Xg) %; 
=(X 1, ---5X;_1,Xj;445+++>Xq)p Q,(x) is the intersection of Q with the line through x 
parallel to the x; coordinate axis, and ¥,(dX;)=dx, ...dx;_,dx;,,...m(dx,). In 
particular v,(dx,)=dx, ...dx,_,. Since %?(Q; v) is complete with respect to the 


norm ||l Ill, 2, v: 


1 


wes 
(7) lleulll,ov= >. (f(D; ul? dv)!” +(f |Dyul?dx)'? + (f |ulPdv)"”, 
i=1 2 2 2 


we can define the following spaces: 


F”(Q; v)=the closure of {ueC'(Q): Ilulll ,,.2,y< OF} in W?(Q; v), 
F§(Q; v)=the closure of Cy(Q) in W%?(Q; v), 


where C!(Q)=the space of functions of C!-class on Q. In situations where no 
confusion of domains and measures occurs, |||-|ll,,o,y. F7(Q;v) and AP(Q; v) 
are written as |||-|||,, 7? and A?, respectively. 

We note some of the simple properties of these spaces. Let H(a,h) 
= {xeR*:|x;—a|<h, 1<i<d} with a=(a,,...,a,) and h>0. 


Proposition 1. (i) A function ueW%?(R*;v) belongs to FR(R*;v) provided 
D,D,ueL?(R‘; dx), 1<isd—1. (ii) Put H=H(a,h). Every function u of F"(H; v) 
has a version v such that (a) v=uv-a.e., (b) v is absolutely continuous on H ,(x) for 
¥4-a.e. X4, and (c) Djv=D,u ae. (iii) The norm || - || is equivalent with the norm 


p.H,v 
|-|,,.H,v in F?(H; v), where 


d-1 
(8) |u|, a.v= 2, (f [DyulPdv)'/? +(f |Dgul?dx)"/? +(f |ul?dx)"”. 
i=1 H H H 
(iv) If F” is replaced by F¢8 in (iii), the assertion is valid for arbitrary domains. 


Proof. (i) We shall show that u can be approximated in ¥? by functions 
belonging to C(R*). We may assume that Supp[u] is compact without loss of 
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generality. Let J,, e>0 be mollifiers and set 


u,(x)=J,* u(x)= | J,(x—y)u(y)dyeCo(R"). 
Ra 


HGlder’s inequality leads us to 
J lu(x)—u,(x)I? v(dx) 
Ra 


< J v(dx) J J,(y)lu(x—y)— ual? dy 
Ra Ra 


<2°-"{f J(y)dy | m(dx,) J lu’ —y’,xy)—ul’, xg)? dx’ 
Ra R!} 


Xa—Yada 


+f JAy)dy J v(dx)) J |Dgu(x’—y’,0)|?de|lya?-"} 
Ra Ra 


Xd 


<2’-"{sup f m(dx,) J |u(x’—y’,x,)—u(x’,x,)|P dx’ 
ly’|<eR! R@-1 


+e?! m(K,) J |D,u(x)|? dx} 
Ra 
—0 as &{0, 


where x=(x’,x,) and K, denotes the x,-projection of «-neighborhood of 
Supp[u]: K,={teR!: |t—x,|<e, (x’,x,)eSupp[u] for some x,€R',x’eR*~'}. In 
the same way 


d-1 
¥Y |D,u—D,u,; 2(R4;dv)| +0 as £10," 
i=1 


because D,u,=J,*D;u in the weak sense and D;D;,u belongs to L?(R*;dx). 
Obviously ||D,u—D,u,; ?(R*;dx)|| +0 as ¢|0, so that \llu—u, |||, 9. Thus we 
get uc FP(R’; v). 

(ii) We observe the inequality 


1 
(9) f jul?dv<2?-! m([a,—h, auth} § \ul? dx +(2h)?-! f1D,uP ax}. 
A 2hiz H 


for ueC'(H), which is derived from the proof of [1; Lemma 5.7]. For 
ueF"(H; v) we now have a version v satisfying v=u a.e. instead of (a), and 
besides (b), (c). But by (9) this v satifies (a). 


The assertion of (iii) immediately follows from (9). 
(iv) It suffices to note R’=|_) H(a, 1/2), where the union is taken over the set 


of all d-tuples a of integers. Indeed, by (9) 
(10) f lulPdv<2?-! M{f |ulP?dx+ fj |Dyul’dx}, ueCZ(R’). qed. 
Ré R4 Ré 


‘In a normed space X the norm is denoted by ||-; X|| except where simpler notations are 


introduced 
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Imbeddings. Following Gagliardo’s arguments [7] (cf. [1; Theorem 5.4]), we now 
show some imbedding characteristics similar to the well-known Sobolev imbed- 
ding theorem. Let H = H(a,h) be a cube as mentioned above. 


Proposition 2. Let 1< p<d, pSq<dp/(d—p) or p=d, pSq< ©. Then 
(11) lu; (H; dv)|| SC, (d,p,q, A) v(H)"4-*?*** |u| «= ue FGA; v), 


where 


a=1 

lull na.v= 2, (J ID, ul? dv)? +(f |Dgul? dx)”. 

i=1 H H 

Furthermore 

(12) lu; 14(R*; dv)|| $C, (d,p,q, A, M)]llulll,, ray  ueFR(R*; v). 


Proof. It is sufficient to establish 


d 
(13) \|u; 19°(H ; dv)|| Sc, (d, p, A) (x j1D,ulP dv)" ueCo(H), 


i= 1H 
d 1/p 
(14) \u; LO(H;dv)| Sex(d,p.A) (¥, [1DyulPav,c+ fll? dy) ueC\(H), 
i=1H H 


where p<d and g,=dp/(d—p). Indeed, in the same way as [1; Lemma 5.12, 
Corollary 5.13, Lemma 5.14] we can see that (13) and (14) combined with 
HGlder’s inequality conclude (11) and (12) respectively. 

We now assume p<d and ueCj(H). e; denotes a unit vector along the x,- 
axis. Put B=p(d—1)/(d—p) (21) and H;=(a;—h, a;+h). 


2h 
(15) <f | yuce+re)l ldtsB f ul?—"[D,u| dx,. 
0 |dt ii, 


If A, ={%,=(X 1, ---5Xj_ 15 Xj4 49 -++oXqg)i (X4,+--,XQEH for some x,eR'} and G,(x,) 
=sup {|u(y)/"“~: yeH,(x)}, then 


J G(%)*~* H (dx) S J ¥,(dxX) J Blul?~*|D,ul dx; 
A; A; i 


=< '|D; uldvi BUY lul®- YP dv, ny |D, ul? dv;,)'/” 


ptf tr dv,y” (Sf DyulPavy) * 
H i=1H 
where p’=p/(p—1). On the other hand, in case of d>2 Hélder’s inequality gives 


d d-1 
f lultedvsf TG &) v(dx)= J m(dx,) J [] G(X) GAx,) dx, 
H Hi= 


Ha A,i=! 


— 1/s 
<fm m(dx,) 4 Tl Ga) dz,) (j G,(%,)4-! d&,)4-») 
Ala i= 


Ha Ala 
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with s=(d— 1)’ =(d—1)/(d—2). Then Gagliardo’s lemma [7] (cf. [1; Lemma 5.9]) 
leads us to 


é-1 eS es lk aia a 
(j G,(x;)° d,) < Il f G(x)*' a dz, 
A= i= id 
sz 
j G(X)*~* dX ja, 
where Ai,,={%ig=(Xq> «+5 Xj— 15 Xia1s +++» Xa_1): (%4, ---» Xg)€H for some x;,, x,ER'}. 
Therefore using Hélder’s inequality again, we find 


fmt Bie 


ex 1/(d—1) 
flultedvs f max, ( f Gye dz,)([ G,(X,)*~" ash 
H H fla 


a 


d 


IA 


(j m(dx,) ) G,(%)*~* dX,4)" "VJ ay ay" 
Ha 


1 ia Aa 


({ GAk)-' 6,(ak))''¢-®, 


Doe, 


" 
- 


II 
=. 


which holds in case of d=2, too. Thus 


d , d 1/p) 1/(d—1) 
flultedvs [1 {uted (x J |D,ulPdv,) ; 


H H i=1H 


wt 1/p) dd —1) 
<}p4 {tray (> j|D,ulPdv,) } (1/A)'/?"4-2), 
H i=1H 


By cancellation we obtain (13). In order to get (14), it suffices to use [1; Lemma 
5.10, (19)] instead of (15). q.e.d. 

Next we are concerned with the imbedding of ¥’(H;v) into C*(H):=the 
space of Hélder continuous functions on H of exponent 4. This can not be 
expected as long as we do not require the functions of F?(H; v) to satisfy some 
additional restrictions. It is known that all functions u which, together with every 
distributional derivative D‘u of order |k|<s, belongs to L?(H; dx) are imbedded 
into C*(H), where 0<A<s—d/p if (s—1)p<d<sp or 0<A<1 if d=(s—1)p. We 
turn this imbedding into a convenient form for our application carried out in 
the next section. 

Before we begin the proof, we introduce a new notation. If ueL},.(Q; dx) 
satisfies 
J vgdv=—fuD,gdx peCZ(Q) 

2Q 


2 


for some veL},(Q;dyv), then there is a version @ such that a=u ae. d is 
absolutely continuous with respect to m on Q,(x) for §,—a.e. , and the Radon- 
Nikodym derivative coincides with vv-a.e. Thus we denote such (uniquely 


determined, up to sets of v-measure zero) v by D,u. 


Proposition 3. Let ueF(H;v) be such that Di.ueF"(H;v), OS|k|<s and 
D,D*.D,ueL?(H;dv), OS|k|<s—1 with s=[(d—1)/p]+1. Then u belongs to 
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C*(H), where 0<A<s—(d—1)/p or 0<A<1 according to (s—1)p<d—1<sp ord 
—1=(s—1)p. 

Proof. Since Dj D¥*'u= Dit! Diu=D*.,D{ Du ae. (OS|k|+|I|<s, e=0,1) and 
D,D‘. Dyu=D*. D,D,u v-ae. (0S|k| <s—1), it is enough to prove that there is a 
version v which satisfies u=v v-a.e. and 


x, yeH <s 
x¥y = 7 


(16) sup lo(x)— o/x— WV? SCa(d, pW) >» || Dy. Dau; L?(H; dx)|) 
|k| 
Ib 


+ YY |DED,D,u; L?(H;dv)|\}. 
O<|k|<s—1 
Let xeH and let H(o)=H(b, o/2) be any cube such that xe H(c)c H. Put H, 
=(a,—h,a,+h), H' ={x' =(x,,...,x4_,)ER*~': (x,,...,x)€H for some x,ER'} 
etc. Since u, together with D,u, 1 Si<d—1, belongs to L’(H; dv), u(-,x,) can be 
approximated in W*?(H’) (=Sobolev space) by functions smooth on H’ for m- 
a.e. X,€H,. Therefore 


1 "i 
u(x’, X)— aaa J u(z’, x,)dz’ an = J bi |D,u(x' +r, x,)|p*~*dpdédr, 
H(o)’ Si i= 


where we used polar coordinates for z’ with center x’:z'=x'+pé, p=|x’ 
—z’|,€eS4~? (=unit sphere in R4~') and 2={(p, é,r):x’+péeH(o), O<r<p}. 
Then by 2c {(p, ,r):0S ps d—1a, x'+réeH(o)} and by Proposition 1 —(ii) 


, 


— 


H(o)' i=1 


for v-a.e. xEH. Set 


r=(d—1)p/{(d—1)—(s—1)p} if (s§—1)p<d—1<sp, 
p<r<0,0<1-—(d—-1)/r(<1)_ if d—1=(s—1)p. 


H6lder’s inequality and the Sobolev imbedding theorem give 


I ' 
u(x’, x4)— i - ag ,X4)dz 


<c,(d,r,hyo'-4@-" YF ( f [DEDL u(y’, yi'dy’)!" at 


7 0,1 Ha H(o)' 
1 


Sc,(d,s, p,r, hyo! - 4-2" ¥ ) J |DE, D5 Di u(y’, t)\Pdy’)' "dt 
Sc,(d, S, p,T, h)o' — 


On the other hand, we have a version w such that w=D,u ae., D,;w=D,D,u v- 
a.e. and for ¥,-a.e. x’ i 
J D,w(x', s)m(ds) if t<x,, 
, , [t, xa) 
w(x ,xX,)—w(x ,t)= x F 
tei £ J D,w(x',s)m(ds) if t2x,. 


[xa, t) 
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Therefore for the same reason as above we get 


. , , 
WHC) hea) Me zal v(d2) 


. , 
HO er 


Sa; 5 f (5 J Dyula’ olde + § [DyD,u(e nlmae)) dz 


H(a)' 


o-oo) fC § [Dyu(z’, tl" dz’)'"at 


ha, H(e)’ 


+f ( f |D,Dyu(z’, oraz" mao 


Ha H(c)’ 
‘Kteaaeeoe + eat |D*, D,u(z’, t)|dz')'/? dt 
O<s|ki|ss— 1 Ha H 
+ J (f |Dt.D,D,u(z’, t)/Pdz’)'’? m(dt)} 
Ha H' 
Scg(d,s,p,r,h)o'—4@-" =P (|| D’. Du; L?(H; dx)|| 
Os|klss-1 


+ ||D*, D,D,u; L?(H; dv)\\), 


for m-a.e. x,. Hence 


, 1 , 
|u(x *)— TH) we? » Z4) v(dz)| 


<c,(d,s,p,r,hyo'—*-9"( DE Dgu; L(H; dx)|| 
osliiss 
Fp 

+ )  |\D*.D,D,u;LP(H;dv)\\), for v-ae. x. 


O<|k|<s—1 


For v-a.e. x, ye H having o=|x—y| we can take a cube H(a)=H(b, o/2) such that 
x, yEH(c) CH. Thus (16) follows. q.e.d. 


Dual Space. We shall turn our attention to the dual space of A(R‘; v), which is 
denoted by F~" (R‘;v) or simply F~-” with p’'=p/(p—1). Let .? and B? be 
linear subspaces of distributions such that: T,€.7/? and T,¢2? are expressed as 


(17) <T,, >= | frgdvt ff, dx, 
Ra Ra 

(18) <T,, >= J (8, 9+82D,9)dx, 
Ra 


for every weCX(R*) and for some f,eL?(R*;dv)nL?(R‘;dv) and f,, g;. 
g,€L?(R*;dx)AL?(R‘; dx). Obviously Sf’ UB’ < F ~*nF ~?. 


Proposition 4. (i) If an element Te.” is given as (17) and Supp[f,] is compact, 
then T belongs to &?. (ii) For any TeB?’ and any e>0 there are ECF (R*) and 





A Construction of Diffusion Processes with Singular Product Measures 
h,eL?(R*; dx) L?(R‘; dx), i=1, 2 such that \|h;||,<e, \\h;||,<€ and 


(19) <T, g>— | Gpdx=f (hyot+h,Dyg)dx geFOnFZ, 
Ra Ra 


where || - ||, stands for the norm of L?(R‘; dx). (iii) Suppose that an element Te.of” 
is expressed as (17) in terms of functions f,, i=1, 2 such that both Supp [f,]’s are 
contained in a closed ball &. Then for given open ball B with BOX=¢ a function ® 
in (ii) can be chosen to be ®=0 on B. 


Proof. Assume the condition of (i). Let H(h)=H(0,h) be a cube containing 
Supp[/,] and put 


f Sf, (x, t)m(dt) — if (x’,x,)€EH(2h), 


F x',x — J (a, 2h) : 
(%0) ‘0 otherwise. 


We choose an «€C??((—2h, 2h)) satisfying «=1 on [—h,h], and find 
ff, gdv=[FD,(ap)dx  peCZ(R’). 


Since F belongs to L?(R*;dx)AL?(R*; dx) we get the assertion of (i). 

Part (ii) is obvious. In fact, for any TeY? and for any ¢>0 there are functions 
W,¢ Cp (R*), i=1, 2 such that ||g;—w;\|,<¢ and ||g;—;||,<¢, where g;, i=1, 2 are 
functions expressing T. Setting =, —D,,,h,=g,—W,, h,=g2.—W, leads us 
to (19). 

Let us show the statement (iii). By b=(b,,...,b,) and o=(0,, ...,04) denote 
the centers of B and & respectively. Selecting a cube H(h) such that Yc H(h), we 
define the function 


ff, Om(dt) if (x’, x,)€H(2h), o,Sb,, 
(xa, 2h) 
G(x’, x=) — j f(x’, t)m(dt) — if (x', x je H(2h),0,>b,, 
(—2h, xa) 
0 otherwise. 


Repeat the above arguments replacing Fg, and g, by G,f,+a’G and aG 
respectively. Since G=0 on B, we obtain conclusion. q.e.d. 


2. Diffusion Processes on R@ 


Let m be the same measure as in the previous section, that is, m is a nonnegative 
measure on R'! satisfying (6). v,j8 are defined by (2). a;;, i,j=1,...,d satisfy the 
following: 

(20) For each i,ja;,=a,, and a;; is of C%°-class on R*. 

(21) There is a constant y(2 1) such that 


d d d 
Y a i vuldx)s i E; €; a; (x) vsldx) Sy D EP v(dx) x, eR’. 





60 M. Tomisaki 


Abbreviating F(R‘; v) as F, we now define the form # by 


9[Bl=F 
d 


Biu,v)= Y § D,u(x)D,v(x)a,;(x) v,;(dx), 


i,j=1R4 


(22) 


where derivatives D,u,1<i<d—1 are interpreted in the weak sense. For 4>0 
put B,(, )\=B(, )+A(, ), (, ) being the inner product of L?(R‘;dx). It 
follows from (10), (20) and (21) that @ is a Dirichlet form on L?(R*; dx). Namely, 
(i) F is a dense subspace of L?(R‘; dx), (ii) F is a Hilbert space equipped with 
the inner product #,( , ), (iii) @ is symmetric and nonnegative definite, and (iv) 
if ueF, then v=(Ovu)AleF¥ and B(v, vl) SB(u,u), where a v b=max {a,b} and 
aAb=min {a, 5}. Particularly, in this case we notice that: Let « be a function on 
R' having the properties «(0)=0 and 0<a(t)—a(s)<t—s if s<t. If ueF, then 
a(ujeF and 


(23) B(a(u), «(u)) S B(a(u), u) < Blu, u). 

Let T be an element of F~? and u be a solution of the equation 
uceF 
Bu,9)=<T.9> geF. 


For each 4>0 we define the operator G, from ¥~? into F by setting u=G, T. 
The Riesz representation theorem asserts that G, T is determined uniquely by 
A>0O and TeF~*. {G,:A>0} is called the resolvent associated with &. It is 
evident by the definition that {G,: A>0} satisfies the resolvent equation: 


(24) 


(25) G,-G,+(—n)G,G,=0 in>0. 


With the aid of results due to J. Deny [3], we see that AG, is subMarkov for 
each A>0: 


(26) O<AG,f<1 whenever O<f<1 and feL?(R‘;dx). 
Moreover the resolvent fulfils the following: 
Proposition 5. G,(C,) is a dense subset of C,, for each A>0. 


For the proof we need auxiliary lemmas. Noting (10), (12), (21) and (23), we 
can easily obtain the following lemma in the same way as [14; Theorem 4.2] (cf. 
[16; Theorem 4.2]). 


Lemma 6. If p>d and TeB?, then 


(27) |G, T ||. C47, 4,4, p, A, M) (Ilgilla+ ligillp), 


i=1,2 


where g;,i=1,2 are expressing T as (18). 


Next we turn to the Hélder continuity of G,T. Following L. Nirenberg’s 
argument [13], we have already gotten in [16] the result corresponding to m, 
=46-measure on a bounded cube. In the present case we can still utilize his idea. 
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Lemma 7. Let Te.* be represented as (17) by means of f,,i=1,2 such that 
Dt‘. f,eL7(R‘;dv) and D*, f,¢ L?(R*;dx) for 0<|k|<d )—1. Then G,T is Hélder 
continuous on R¢ with exponent 1:0<1<1/2 if d is even or 0<I<1 if d is odd. 
Proof. Let H=H(a, 1). Set u=G, T. For any r(l1Srsd—1) and geF we denote 
by A,@ the difference quotient: 4, o(x)=(o(x +he,)—@(x))/h, where heR' and 
e, is a unit vector along the x,-axis. B,(u,«4,9)=<T,«4,o> for every peF 
and «€ C5 (H), whence 


d 


(28) BA _,(«u), p)= —<T,«4,9>+ z J {(D,u)(D;2)(4,9)4;; 
H 


i,j=1 


—(D;,«)u(D; 4,9) 4,;—D,(au)(D,9)(- +he,) 4,4; ;} dy;;, 


where D; means D; for 1Si<d—1. On the other hand, in the similar way to 
Sobolev spaces one can easily show 


(29) f (4,9) dvs j (D,o)?dv, and 
R4 Ré 


(30) lim { (4,9—D,9)?dv=0. 


h-—0O Ra 
Combining (29) with (20), we derive from (28) that 
|B ,(4_,(au), P| Seillou.. Fo (Hv), 


c, being independent of g and h. Taking p=4_,(«u)(€EA%,(H; v)) with suf- 
ficiently small h, we find by (21) 


(y~! AA)\A _Mauls nw yS BA _,(au), A_,(@u))Sc,|4 _nl(@U)|> Hy 


from which {4_,(au)} is a bounded set of A?2(H; v). In cosideration of (30) we 
apply the Banach - Saks theorem to obtain a subsequence {4_, (au)} whose 
Cesaro sums converge to D,(xu) in F2(H; v). Repeating this argument we have 


(31) D'(au)eF%2(H;v) OS|k|<dy. 


Now &,(u,«D*.~)=<T,«D*.g> for every peCZ(H) and «eCH(H). Since 
D5 D*.u=D*. Diu ae. (e=0, 1), an integration by parts gives 


J D*.(a,,D,(au)) D, pdx =| Dt. (af,) pdv+ j Dt.(af,) pdx —A{ D*. (au) pdx 


d-1 d-1 
+ ¥ JD’(«D,(a,,D,u))odv+ ¥ [DE (aD ,(a,;D,u)) pdx 
j=1 


i,j=1 


—| D*.D,a,,uD,%) pdx —| Di(a,,D,uD,%) pdx, OS\k|Sd,—-1. 
Therefore we get by Riesz’s representation theorem 


J Dy (iq D,(au)) D,pdx=| Fodv 
H H 
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for some Fel?(H;dv) and any @eC(H), which implies D,D*. 
-(a44D,(au))e2(H; dv), and so D,D*. D,(au)eL?(H; dv),0<|k|Sd,—1 because 
44 iS positive and smooth. 
We have just shown that au satisfies the conditions of Proposition 3. Hence 
aue C'(H). Since « and H are arbitrary, we thus obtain the conclusion. q.e.d. 
We are finally in a position to show 


Lemma 8. G,(4’)< C,, for each p>d and A>0. 


Proof. Let Tbe any element of Z’. By Proposition 4-(ii) and Lemmas 6 and 7 we 
have a sequence {®,}< C%(R*) such that G,®, converges uniformly on R‘ to 
G, T. To complete the proof it suffices to show G,(Cf(R*))< C,,. Let us fix A>0 
and ®e€C%(R*) and put u=G,®. Choose a unit cube H=H(a, 1/2) such that 
HoSupp[®]=¢. Then 4,(u, g)=9 for all pe Cf (H). By using (11) and (21) we 
can repeat the arguments in [14; Theorem 5.1] (cf. [16; Theorem 4.3]) to get 
sup {|u(x)|: xeH(a, 1/4)} <c(y,d, A)||u; L(H; dv). Thus we obtain the con- 
clusion in the same way as [16; Proposition 5.1]. q.e.d. 
We now proceed to a 


Proof of Proposition 5. We use an idea of Kunita [12]. Fix any p with p>d. Let 
us define the operator L from C?(R*) into &? ZB? as follows: For each 
ueCe(R*) 


d 
(32) (Lu, p> = d J PilayDiM oar, peCg (R’). 
i, j= 

What Lu belongs to ’ 8? is derived from the fact: D,(a,;Dju)ECo, 1 Si,j<d 
and Proposition 4-(i). Furthermore we set T=(A — L) u(E.’ NB”), where A(>0) is 
arbitrarily fixed. Then by Lemma 8 G,(@’) is a dense subset of C,,. An 
application of Proposition 4-(ii) and Lemma 6 finishes the proof. q.e.d. 

By (25), (26) and Proposition 5 we can appeal to the Hille- Yoshida theorem 


to get a nonnegative, strongly continuous and subMarkov semigroup {T,:t>0} 
on C,, such that 


Gf (x)= f en* T,f(x)dt feC,,,A>0. 
0 


Therefore we are also given a unique associated Hunt process X =[x(t), ¢, P,] on 
R¢. 

Let us show the continuity of the sample paths following Kanda’s arguments 
[11]. Let B be an arbitrary open ball and Q be another open ball with QNB=¢@. 
Then there is a function ueC%(R‘) satisfying u>0 in Q and u=O0 outside Q. 
Considering Lu defined by (32) with such u, we set T=(A—L)u, where A>0 is 
fixed arbitrarily. Notice u=G, T. Since T fulfils the conditions in Proposition 4- 
(iii) replaced by Q instead of 2, by Lemma6 we have functions @,,n: 
=1,2,...eC3(R*) such that their supports are contained in B and G,®, 
converges uniformly to u. Put t=inf{t>0:x,¢B}. Then by Dynkin’s formula 
G,®,(x)=E,[e~"G,®,(x,)] (xeB). Making n> 00, we get u(x)=E,[e~*u(x,)] 
=O(xeB), that is, PR(x,¢eQ)=O0(xeB). Since Q is arbitrary, we are led to 
P.(x,€0B)=1, which means the continuity of sample paths of X. 
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Finally we arrive at 


Theorem 9. For the form @ defined by (22) there exists a unique diffusion process 
X =[x(t), ¢, P.] on R¢ such that 


4 
G,f(x)=E,[fe-“f(x, dt] fec,,A>0. 
0 


3. Proof of the Theorem 


Closability of 2. Let {u,}< C9(Q) be a sequence such that 2(u,,—u,,u,,—U,)>0 
as m,n—oo and (u,,U,)9—70 as n— oo. It is sufficient to verify that 2(u,,u,)—0 as 
n— oo. First we show that the 


n>ooi=1 


d 
(33) lim } [(D,(au,))?dv,,=0  aeCZ(Q). 
2 


For any «€C®(Q) there is a closed cube H = {xeER*:1;<x;<r,, 1 Si<d} such that 
Supp[a]<H, inf {mo(t):|,St<r,}>0 and m([l,,r,])< 0. Then in the same way 
as (9) 


(34) J (au,,—au,)? dv Sc, {f (au,,—au,)?dx+ f (Dy(au,,—au,))? dx} 
2 H H 
for some c,, and by (3) 


Sy (Un — Uns Um — Unda + 2(Um — Uns Um — U,)} 70 as m,n co, 


for an appropriate c,. Therefore by using (3) again, we have 


d 
lim y f (D(au,,—au,))?dv,,52 lim Y [(«D,(u,,—u,))? dv;; 


m,n>oj={12 m,n-ooj=1Q 
35 
7 Sc; lim Qu,,—u,,u,,—uU,)=9, 
with some c, and for every «€C>(Q). From now on « is fixed arbitrarily. Since 
{D;(au,)} is Cauchy sequence in L?(Q; dx) for each i, and since (au,, «u,) 0 as 
n— oo, the closability of Dirichlet integrals ensures 


(36) lim | (D,(au,))? dx =0. 


n—-cooQ 
Hence by the same method as (34), we obtain 


d-1 d-1 


Y f Dau,) Dau, dv|=| ¥ § D?(au,)au,,dv 


i=12 i=1Q 


d— 1/2 
Sea} . J (DF usar} {{¢ ty)? dx-+ § (Dg) ax} 


1/2 
i=1 


—-0 asm—-o, 
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for each n and for some c,. Combining this with (35) and (36), we get (33). 
Indeed, 


Fs 
lim y J (Dia u,))° dv,;= lim 3 J (Da u,))? dv 


n-Oi=12 n>00 i=12 


d-1 =a 
< lim imsup} 5 J (D(au,—au,,))? dv +2 Di J Dau,) D(au,,) dv } 
i=12 


noo m-—0o i=1Q2 


=0. 


Put Q, ={xeQ: |x| <1, dist(x, dQ)>1} and Q,={xeEQ: |x| <l, dist(x, dQ) > 1/]} 
—Q,_ (122). Then by using (33) we find 


0<IUu,, u, => < tide, 


“a jana 
1=1i1,j=12) 


< y oa , 


l=1 m-o@ j 


j D;(u, — Up) D(u,- U,,) a;,dv 
2 


12 


425 { Diu, Dj yaar} 


a frm 
i,j=12) 


00 d 
<¥ limint| § Diu, tty) Dy ty) 44 


l=1 moo Lj,j212, 


d 1/2 ( a 1/2 
+2140)1 5 [Diu ar.) » j ing)?aruh 


i=12) i=1Q) 


a) d 
Sliminf Y YJ D\u,—u,,) Du, —U,) 4;;4;; 
m>o l=1i,j=1 2) 


=liminf Qu, —U,,, Uy, — Up), 


m-— oo 


whence lim Au,,u,,)=0. 


n—©o 


A Diffusion Process on Q. We divide the proof into two steps. 


Step 1. Assume that we have a domain Q! such that Q<Q', m is defined on Q}, 
inf {mo(t): teQ>} >0 and m(Q})< oo. Moreover every a;; is of C“°-class on Q' and 
there is a constant (= 1) such that 


d d 
y' > &v(dx)s 5 E,€;a;(x) v;,(dx) Sy Le 2vi(dx) xeQ', EER’ 
ial i,j=1 i= 


Under these assumptions the form &, the smallest closed extension of 2, is given 
as 


DE] =F5(Q; v) 


d 
6(u,v)= Yo J D,uD,va,,dv,; 


i,j=12 


Here derivatives D;, 1 Sis<d—1 are taken in the weak sense. 
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We shall extend this form as follows: Put m=m on Qj, or =the one- 
dimensional Lebesgue measure outside Q) and set *(dx,...dx,) 
=dx,...dx,_, m(dx,). Define ¥,,, 1 Si,j<d on R* by (2) replaced by ¥ ewer of 
v. By the above assumptions we have also 4,,, 1 Si,j $d on R* such that G;; 
on {2 and they possess the properties (20) and (21) with d,; and ,; in place of a, 
and v,; respectively (see [16; Lemma 5.6]). Let us introduce the form é: 


Bb] = Fo (R*; ¥) 


d 
&(u, a ds, J Pind, 2b y ay 
where D;=D, for 1 Si<d—1. This may be considered an extension of &. 
Since é fulfills the conditions in the Previous : section, Theorem 9 asserts that 
there exists a unique diffusion process X =[X(t), 2, P.] (xe¢R*) whose resolvent R, 
defines a operator on [?(R‘; dx) and satisfies the equation 


R, feD[é] 
AR,LO=(£9) pele] 
for every feL?(R*; dx) and A>0. . 

Let ¢=inf{¢>0: €(1)¢Q} and set X,=[X(t 0 2), z, P.] (xeQ), which is called a 
part of X on Q. With the aid of a remarkable result due to M. Fikushima [6; 
Theorem 4.4.2] we find that if {Ry,: 4>0} denotes the resolvent operator of Re 
then Ry, f is a solution of 
ue FQ; v) 
8,UM=(L0)2 vEeF5(Q;¥) 
where fel?(Q; dx) and 1>0. Since {=v on Q, &=€& on F2(Q; v) and a solution 


of (37) is unique according to the Riesz representation theorem, X, is none 
other than the desired process. 


(37) 


Step 2. We turn to the general situation of the theorem. Let {Q,} be an 
increasing sequence of bounded subdomains of Q such that |_) 2,=Q and each 


closure Q, is contained in Q,,,. For each n we define the form 6, by 


DE, = F2(Q,;v) 


d 
é,(u, v)= Y J D,uD,va,;dv;;, 


i,j=1Qn 


where D;=D, for 1Si<d—1. Then Step 1 says that for each n there is a 
diffusion X, on 2,, which is uniquely associated with &,. Considering the 
method of construction of X,,, and using Fukushima’s result again, we see that a 
part of X,,, on Q, is equivalent to X,. Therefore we invoke [2] to get a unique 
diffusion process X on Q such that part of X on Q, is equivalent to X,, for all n. 
Of course X must be associated with &. Since the property (i) of the resolvent R, 
follows from the proof of Lemma 7, the proof of the theorem is 
completed. q.e.d. 
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4. Polar Sets 


In this section we are concerned with polar sets of our diffusion process. 
Given two data {a‘})}, {a{7)} satisfying the conditions (2) and (3), let X°, X 
be the corresponding diffusions of Theorem. Then we have 


Theorem 10. A set NCQ is polar with respect to X“ if and only if so it is with 
respect to X?, 


Proof. In the case of Theorem 9 or Step 1 of the proof of the Theorem we have a 
resolvent density by Lemma 6, so that a set is polar if and only if its capacity is 
zero [5; Theorem 4]. But we see trivially that capacities corresponding to the 
given two data are equivalent. Hence we get the conclusion. In the general case 
there is a sequence {Q,} of bounded sets such that X‘, a part of X on Q,, 
corresponds to Step 1 of the proof of Theorem for i=1,2. In order that N is 
polar with respect to X, it is necessary and sufficient that NAQ, is polar with 
respect to X“” for every n satisfying NAQ,+¢. Therefore the desired assertion 
follows from the former case. q.e.d. 

Since F3(Q; v)< W)?(Q) (=Sobolev space) under condition (6) [6; Example 
4.4.1] guarantees the following 


Corollary 11. Any polar set of our diffusion is also polar with respect to the 
Brownian motion. 


It is now our intention to show that the converse is not necessarily true. 
Namely that it is possible that our diffusion with a (d—2)-dimensional plane is 
not polar. To do this we need to study the structure of a Dirichlet space which is 
associated with a direct product motion of one-dimensional diffusions. 

Let X°=[x, P], i=1, ..., d—1 be independent one-dimensional Brownian 
motions on R' and X =[x, P] be a conservative one-dimensional diffusion 


on R' with the generator ities” qa’ Here n is a bounded Borel measure on R'! 
n da 


such that 


0 0 0 0 
J dafn(db)= +00, J n(da){db<o, 


, da|n(db)= + «, J n(da) { db< 00. 
0 0 0 


0 


+00 


These conditions mean that the boundaries + oo are entrance, but not exit [10; 
§4.6]. X has transition probability densities p(t,a,b) such that 


0Sp(t,a,b)= p(t, b, a) 
is continuous on (0, + 00) x (— 00, + 0) x (— 00, + 0), 


0 
3 Pl b)=G™ p(t, a, b), 





p3 (t, — 00,b)=pz(t, + 00,b)=pFz(t,a, — 00)=p;(t,a, + 00)=0, 
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where G is applied to a or to b and p}(t,a, —— {p(t,a+h, b)— p(t, a, b)}/h, 
P3 (t,a, b) _ {p(t, a, b)— p(t,a,b—h)}/h etc. (see [10: 84, 11). 
Put 


d 
X°= [=o 9 he = TT | x=m(x;,..., xJER*. 
i=1 
We denote by (F°, €°) the Dirichlet space associated with X°. As can easily be 
seen, in the same way as [8; Example 4.1], (F°, &°) is related to I2(R‘; dv°) and 


(40) CR(RYCF?, 
(41) &°(u, n=5 (Ss yr J D,uD;vdv°+ ! D,uD,vdx) u, ve CP (R*), 
i=1 R4@ 
where v°(dx, ...dx4)=dx, ...dx4_,n(dx,). Moreover we find that: 
Lemma 12. (F°, &°) is regular, that is, C%(R*) is dense in F°. 
Proof. Let R° [resp. R\] denote the resolvent of X° [resp. X] and introduce a 


new function space: 


I 
-{5 P(x’) RY Wi(xq): PE CO(R*~"), Wie CO(R"), 12 i. 


i=1 


Since 
lal, ea, yo/2(d + 1) SP (u,u)S |llulll ga,yo, ueCg(R*) 
and R°(#) is a dense subspace of F°, it is sufficient to verify 
(42) RY(H)< Fo (R"; v°). 
In fact (40) and (41) assert 


F3(R‘; v)\coF?, 


d-1 
EP u, w=; ( YJ (D,u)? dv?+ f (Dyu)*dx) + | u2dv® ueFZ(R4; v°); 
Ra Ra 


i=1 R4 


on the other hand it follows from (42) that F¥°- F?(R‘; v°), concluding F° 
= F2(R*; v°). 
Fix any u(x’,x) =@(x’) R@ W(x,)e# and set v=R™ p. Then 


RY u(x) =RY u(x’, x)= J glxa.y', ya) G(x’ +y’) (ya dy’ n(dy,), 
Ra 


g(xay, w= fer (2nt)- (4-1/2 @-Wy'17/2t p(y, x, y,) dt. 


Therefore for 1 Si<d—1 


(D; RY u(x)? =(f (xa, ¥' ya) Di O(' +) v(y) dy n(dy,))” 
Ra 


< a(X4,y', Va) (D; O(x’ + y/) v(yy))? dy’ n(dy,), 
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and so 
||D,(R} u); 2(R*; dv°)||? 
< Bi (g(x)? dx’ J v(y4)” n(dy,) FE 8(X4,¥', Ya) n(dx,) dy’ 
SD, 9; P(R*~*; dx’)|? |v; E(R*; dn)|}? < co. 
By (39) we obtain the following in distribution sense: 


D,R{u(x)= f g(x’ +y’) v(y,)dy'n(dy,) J e~'(2nt)-4- 9? 
Ra Hl 
-e-W'P2tD dit, X4,V,) at. 
By using (39) once more, we have also 


+ D,p(t,a,b)v(b)n(db)= |  n(db) T t.b,.G® v(c)n(dc), or 


(—oo, a] - 


=— f n(db) rs p(t, b,c) G v(c) n(dc). 


[a, +00) - 


Since |G vy] =|R@ wy —W| <2sup|yI, 


(DARiux))P?< fe’ +yP dy’ f ent) e-bay 
R¢@-1 0 


x 4suplyl? {m((— 00, x4]) 1, , oa) + (LX 4, + 2) Iho, + oo) %a)}?. 


Hence 


|D4(R} u); L-(R*; dx)|\? 


0 
<4sup|y|? lg; L(R4~*; dx’)\|?n((— 00, + ~)} J n((- 2, x,)dx, 


+ [ n([x4, + ~))dxyh 
0 
<00. 


By the same method 


d-1 
||D,D,(R¢ u); 2-(R4; dx)||? < 0. 
1 


Thus Ru satisfies the conditions of Proposition 1-(i), which leads us to 
Ro ue FZ(R!; v°). q.e.d. 


Let k be an integer satisfying OSk<d. k=(k,,...,x,) denotes a k-tuple of 
integers satisfying 1<x,<...<x,<d. Given a=(d,,...,a,)EQ, let E,(a) denote 
the k-dimensional plane in R*, which contains a and is parallel to the plane 
spanned by coordinate axes corresponding to Xqyo eres Xq, E,(a)={xER*: x; =a; if 





A Construction of Diffusion Processes with Singular Product 69 


i¢x}. In particular E,(a)={a} if k=0. Let X=[x,,¢, P.] (xeEQ) be a diffusion 
process of the Theorem. 


Theorem 13. For E,(a)Q to be polar it is necessary and sufficient that (i) k<d 

—3, or (ii) k=d—2 and xad, or (iii) k=d—2, x$d and j jimias—s0, 
-—1/2 0+ \O 

+ spas} dt=o. 


Proof. It is enough to provide a proof in the case of Theorem 9. Given a point a, 
put H=H(a,1). Introduce a new positive Radon measure n on R’, which 
coincides with m on (a,—1, a,+1) and satisfies (38). Let us consider the above 
mentioned direct product diffusion X° =[x?, P°](xeR*) corresponding to this n. 
Let X,, and X}, denote parts of X and X° on H, respectively. Noting Lemma 12 
and [6; Theorem 4.4.2] we find that both Dirichlet spaces associated with X, 
and X}, are regular. Furthermore we notice that X and X° have resolvent 
densities. Combining (10), [5; Theorem 4] and [6; Theorem 4.4.2] with these 
facts, we find that E,(a) is polar if and only if the capacity of E,(a)\H with 
respect to X,, (and hence X}) is zero; moreover equivalently the capacity of 
E,(a) with respect to X° is zero. Thus these statements are equivalent to P?(¢?. 4) 
<0)=0, xeR*, where of ,.=inf{t>0: xPeE,(a)}. It is evident that P(e? ,,) 
<o)=1, xeR* if k>d—1, and P°(o?, ,, <0)=0, XeER* if k<d—3 or if k=d 
—2 and xsd. Therefore we may assume that k=d—2 and «$d, which reduces to 
the case that d=2 and k=0. However the necessary and sufficient condition in 


t 
order to P°(o),<0o)=0, xeR? is that the integral j Jfmias—ssa 


0+ LO 


—1/2 
+s)) ash dt diverges, as we have already seen in [15; Theorem 5.1]. q.e.d. 


Example 14. Let 0€Qc¢R*. Assume that m((—t,t))=const.xt® as t|0 with 
0<f<1. Then for 1Si<d—1 and «eR' the plane S={xeEQ:x,;=a, x,=0} 
cannot be polar with respect to our diffusion as long as it is not empty. But it is 
polar with respect to the Brownian motion. 


Acknowledgement. The author would like to thank M. Fukushima who suggested this problem. 
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Loi de Pindice du lacet Brownien, 
et distribution de Hartman-Watson 


Marc Yor 


Laboratoire de Calcul des Probabilités, Université Pierre et Marie Curie, 4, Place Jussieu, 
F-75230 Paris Cedex 05 


1. Introduction 


Dans tout cet article, t désigne un nombre positif fixé, z, est un nombre 
complexe, non nul, et a=|Zo|. 

Soit IP_, la loi du mouvement brownien complexe (Z,=X,+iY,),. issu de 
Zo> définie sur Pespace Qe=C(R,,C), muni de sa tribu borélienne 3 (pour la 
topologie de la convergence uniforme sur tout compact de R ,). On note encore, 
pour tout z=x+iyeC, IP2, la loi du «pont brownien complexe», entre 0 et t, 
issu de z, en t=0, et aboutissant en z, au temps t=t. Rappelons que: 


dxd y _& ——_ (y-yo)? 
(27 1) © 


(1.1) R13. 


ou 3,=0{Z,,0Ss<t}, égalité que on peut réécrire en: 
(1.1) pour tout Ae3,, P,,(A|Z,=2)=BP,(A) 


(tous les rappels nécessaires, concernant le pont brownien- réel ou complexe- 
sont faits au paragraphe 2). 

Paul Lévy a montré que le point 0 nest pas visité par le processus (Z,),. 9, 
P,, p.s. On déduit donc de (1.1), ou (1.1'), que: 


(1.2) dxdy ps. Pr (ate[0,t], Z,=0)=0, 
et, en fait, on montre au paragraphe 2, plus précisément, que: 
(1.2’) pour tout z+0, Ie (ate[0,t],Z,=0)=0. 


On peut donc définir, par continuité, pour tout te[0,t], Paccroissement de 
largument entre 0 et t, soit: 0,(w)—99(@), de la courbe (Z,,(w), ue[0, t]), et ceci 
peut étre fait, d’aprés ce qui précéde IPF p.s., pour tout z+0 (et donc P,, p.s.!). 

Lun des buts de cet article est de calculer la loi de (0,—6 ), pour IP pour 


Zo? 
déf 


tout z+0, et en particulier pour z=Z,, auquel cas, sous IP* = IP&°, le processus 


0044-3719/80/0053/0071/$05.00 
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(Z)o<:<, décrit une boucle, que lon appelle lacet brownien, et lon a: 
(0,—0,)€2nZ, IP* p.s. 
Par ailleurs, on rappelle, au paragraphe 3, que (0,— 0) est une IP,, martingale 
t 


locale continue, dont le processus croissant est égal a iF Z 
0 s 
meneés ci-dessous permettent en fait d’obtenir, de fagon explicite, la loi conjointe 


de (0,—66; 


et les calculs 


«as 
lize 
Une étape importante du travail est constituée par l’obtention de la loi, sous 
P,,, de (0,—0 9), conditionnellement a |Z,|, loi qui est, précis¢ment, donnée par 
la formule suivante, valable pour tout veR: Ivy (°°) 
ete 


16%) 
\c 
ou I, désigne la fonction de Bessel modifiée, du premier ordre, et d’indice v. 
Ce résultat donne une interprétation probabiliste des distributions p, et 


n,, définies pour tout r>0, respectivement par leur transformée de Fourier (resp. 
Laplace): 


) sous IPF (z +0) (voir le paragraphe 6). 


(1.3) E, (er ~% ||Z,|=p)= 


> 


Iu) lyn) 
v To et A io 


u(resp: y,) est désignée, dans la littérature, comme la distribution développée de 
von Mises - en anglais: «unwrapped von Mises distribution» -, resp.: distribu- 
tion de Hartman-Watson (cf. [4]; 1974). 

Les mesures pl, et y, sont absolument continues par rapport a la mesure de 
Lebesgue (respectivement sur R et R,); leurs densités sont étudiées au 
paragraphe 5. 

La méthode qui nous a conduit a la formule (1.3) permet également - c’est 
la le second aspect de ce travail - de redémontrer (cf., paragraphe 4) des résultats 
plus récents de Hartman ([3]; 1976), 4 savoir: 

(1.4) pour tout couple (r, R) vérifiant: 0<r<R<o, les fonctions suivantes, en 
AeR,: 
Tyr) TolR), KyxlR) Kol) 
I(r) Tyz(R) ~— Ko(R) ” Kyax(?) 


sont les transformées de Laplace de probabilités sur R ,. 

Le paragraphe 6 est consacré a expliciter la loi de indice du lacet brownien 
défini plus haut, tandis qu’au paragraphe 7, on développe, en dimension 
supérieure a 2, certains des résultats obtenus précédemment en dimension 2. 





2. Quelques précisions sur les ponts browniens réel et complexe 


Dans le seul but de simplifier les expressions qui interviennent dans ce paragra- 
phe, on suppose le temps t égal 4 1; les arguments utilisés sont également 
valables lorsque t est un réel strictement positif. 
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Soit, sur un espace de probabilité (Q, 2, P), un mouvement brownien réel 
(B,),>9, issu de X9. On note (%,),>9 sa filtration naturelle, et 


saci 0 Bessy v o{B;}) 


K. Ito ((5]; voir aussi [7]) a remarqué que le processus (B,),-, est une 
(8,) quasi-martingale, dont la décomposition canonique s’écrit: 


(2.1) B= 8,45 21-82 as (t<1). 
0 l-—s 


Soulignons que, d’une part: 
(2.2) (B),< est un (B,) mouvement brownien, issu de x, et, en conséquence: 
(2.3) (B,,<, est indépendant de (B,); 
d’autre part: 

; |B, —B,| 


(2.4) Ej 


——-ds<oo. 
0 (1-s) 


Par ailleurs, si xeIR, on considére l’€quation en U: 
U. 
(Xo, x): ds 


dont lunique solution s’explicite aisément: 


(2.5) Uaxt+B,—(-0) {| OS (t<1). 


De plus, on a le: 


(2.6) Lemme. La solution (U,) de e(xo, x) vérifie: lim U,=x, P pss. 


ttTtl 


Démonstration. D’aprés (2.5), il s'agit de montrer que: 


t 


Bd 
at=9§ 


9 (1—u)? ttt! 0, 


ou encore que: 


B, du 
H,=B,t—(1- as aa 
Or, 
pF —B y) du 
B=t-3)— 
Rappelons le résultat élémentaire suivant ([(7], Lemme 9): si g: [0,1]— R,, est 


une fonction continue décroissante, telle que g(1)=0, alors, pour toute fonction 
1 


f 20 vérifiant | f(u) g(u)du<oo, ona 
0 


lim g(t) j f(u)du=0. 
ttt1 0 
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D’aprés (2.4), ceci s'applique, P(dw) p.s., a g(t)=(1—1), et 


|B, —B,|(@) 
f= (ao? 

Sur lespace Qg=C(IR,,1IR), on considére le processus des projections 
X ,(@)= a(t) (t=0), la famille de tribus 6, =o {X,,s St} (t20), et G=V,G,. 

P., désigne la loi, sur (Qp, G), du mouvement brownien réel, issu de x9. Pour 
tout xelIR, notons P* la distribution, sur (Qk, G,), de la solution (U,),., de 
e(X9, x), prolongée par continuité en t=1 (donc U, =x, d’aprés le lemme (2.6)). 

D’aprés (2.1) et (2.3), on a: 


(2.7) Vapplication: x P* est une désintégration réguliére de P.,, par rapport a 
la tribu engendrée par X ,. 


Pour tout couple (xo, x), on note PB(xp, x) le triplet (Qp,,(X,),<;, Px): on a 
ainsi défini, d’aprés (2.6) et (2.7), le pont brownien issu de Xo, et aboutissant en x 
au temps t=1. 


Remarque. La présentation de PB(x,,x) adoptée ici est semblable a celle de 
Mackevicius ([12]), mais les demonstrations sont différentes. O 


Pour tout couple ((x,x); (Vo, y)) de points de R?, on note z9=x 9+iYo, 
z=x-+tiy, et on définit, sur (Q¢, 3,), PZ,=P5@ P?, (modulo les identifications 
canoniques!): cest, par définition, la distribution du pont brownien complexe 
PB(z, z) issu de Zp, et aboutissant en z, au temps t = 1; d’aprés (2.6) et (2.7), ’égalité 
(1.1) est réalisée, ainsi que: IPF (Z, =z)=1, pour tout z. 

De plus, on a le: 


(2.8) Lemme. Pour tout couple (zo, z)eC*, IPF (A te] 0, 1[, Z,=0)=0. 


Démonstration. 1) (X,) et (Y,) désignent respectivement les parties réelle et 
imaginaire de (Z,). 
Définissons les (3,) temps d’arrét: 





tp READ _ yy)2 
T,(a)=inf {1/Aza jf SAP tO gon! 
0 (1—s) 


Admettons provisoirement que: 
(2.9) Ps, 7, fT1. 


Définissons, en calquant les notations sur celles du début de ce paragraphe, les 
processus: 


t 
(es en 
‘ ae 


X,=lim X, (qui existe IP*, p.s., par définition de PB(xo, x)), 
ttt1 


Os kth 
s) 


et (Y),<; defagon analogue. 
Par définition de Pi, Z=((X,+iY),<,) est un (IP¥,(3,),<;) mouvement 
brownien complexe issu de Zp. 
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D’aprés le théoréme de Girsanov, pour tout n2 1, le processus (Z, , 7,),> 9 est, 
sous la probabilité: 


Ty es "4 — 7, 
a =exp} — f L(x athe WaT a, 
} pe 





un mouvement brownien complexe, issu de z,) en t=0, et arrété en T,. 

Les probabilités Q™ et IP étant équivalentes, on a, d’aprés le théoréme de P. 
Lévy déja mentionné: pour tout zeC, IPF (ate) 0, T,], Z,=0)=0, d’ou le résultat 
cherché, d’aprés (2.9). 

2) Il reste 4 montrer (2.9), qui sera une conséquence immédiate de 
assertion: 


1 oe 2 
(2.10) pour tout (x9, x)elR?, (or ds=00, PX ps. 
; @« 


Or, d’aprés (2.5), on a, pour tout s<1: 


Ri. St WED 
aa" X)+) du. 


Ainsi, puisque (X wus St, sous P*, un mouvement brownien réel issu de Xo, 
(2.10) équivaut a: 


1(s om 
(2.11) fff Ra eM aso P. ps. 


olo ( —u)? 7 
Montrons maintenant (2.10). D’aprés [7] (corollaire 3.2), on a: 
j (X,-X,) 
9 (1s) 
-(x—X,)? ds __ 


Ss. | ———+— Gj ES PS 
ee to <i 


ds=0, P., ps. et donc: 


(2.10) est donc réalisée au moins pour un xeER, et donc, d’aprés l’équivalence de 
(2.10) et (2.11) (qui ne dépend pas de x!), pour tout xeR. O 


Ainsi, si 792 +0, on peut donc définir, comme il a été dit dans l’introduction, par 
continuité, pour te[0,1], Paccroissement de l’argument entre 0 et ft, soit: 
0,(@)—9o(@), de la courbe (Z,(@), ue[0, t]), IPF, p.s. 

Le lemme suivant nous sera utile au paragraphe 6. 


(2.12) Lemme. 1) Soit zyeC, et F: Qg—>R une fonction bornée, continue pour la 
topologie de la convergence compacte, et 3,-mesurable. Alors, l'application: 
z—> IE? (F) est continue. 

2) Soit z)+0, et f: RR une fonction continue, bornée. Alors, l'application: 
z— Ez. Lf(0, —9)] est définie, et continue, sur C\ {0}. 


Démonstration. 1) Rappelons, avec les notations du lemme (2.8), que: 


P 2a 
Z,=2t+Z,—-(1-1) | —"— 
0 U- 


uy?” (t<1) 
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et que Zhe: est, sous IPS , un mouvement brownien complexe issu de zy. Ainsi, 
si on définit la famille d’applications (y,(@)), de R., dans C, par: 

~ Z,(@) du 
zt+Z,(w)—(1-—t) | ———— 
+Z(o)—(1-9 | 


z (t2 1), 


alors, on a: IE: [F(w)]=IE_,[F(y(@))], et la premiére partie du lemme découle 
de ce que, pour tout wEQz,, l’application: C — Q¢ est continue, Q, étant 


[y.(@)]()= _ 


z—y,(@) 


muni de la topologie de la convergence compacte. 
2) Supposons z, +0. Alors, pour tout z+0, l’'application 


Arg:0-+0,(a)—0p(o)sIm4 f a 

@[o, 1) 

est définie sur Pouvert 0={w/Vte[0, 1], w(t)+0} de Q¢, qui est, d’aprés le 
lemme (2.8), de mesure pleine pour IPE; de plus, Arg est continue sur 0. La 


seconde partie du lemme découle donc de la premieére. 


3. Parties radiale et angulaire du mouvement brownien complexe, sous IP, , 


Dorénavant, z, est un nombre complexe non nul. Ce paragraphe est une suite de 
rappels succincts, extraits de [16] (voir aussi Getoor-Sharpe [1]). Une applica- 
tion simple de la formule d’Ito montre que, sous P,|: 


'dZ 1 
(3.1) Z,=Zo exp ({ ‘) , 
o Z, 
ce qui équivaut a: 
dZ 


(3.2) log.,(Z,(@)) —log,(Zo(o)) = f >*, 
0 


s 


ou log,, désigne une détermination continue du logarithme le long de la 
trajectoire Z ,(w). 

Identifions respectivement les parties réelle et imaginaire des deux membres 
de (3.2). Il vient, en notant p,=|Z,|: 


t 
d 
(3.3) log p,—log py = o: 
0 Ps 


(3.4) 0,-0,=f —, 
0 


s 


*  Contrairement a ce qui peut sembler au premier abord, cette formule n’est pas vraie pour toute 


martingale complexe, continue (Z,), qui ne visite pas {0}; en fait, une telle martingale complexe (Z,) 
vérifie (3.1) si, et seulement si, elle est conforme (c'est 4 dire: Z? est également une martingale locale) 
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ou 


‘(X,dX,+ Y,dY,) 
B,=J > 
0 Ps 


x,dY,— Y,dX,) 

Ps 
sont, d’aprés la caractérisation de Paul Lévy, deux mouvements browniens réels 
indépendants, puisque: <B, B>,=<y, y>,=t, et <B, y>,=0. 


D’autre part, une nouvelle application de la formule d’Ito 4 p,=(X? + Y,”)'? 
montre que: 








t 
et nafs 


‘ds 
(3.5) P:=PotB, +4) —. 
0 Ps 


La fonction: x— 1/x étant différentiable sur J0,°[, et (p,) ne s’'annulant pas, on 
déduit aisément de (3.5) que: 


(3.6) les processus (p,) et (B,) ont méme filtration. 
Ainsi, d’aprés (3.4), et Pindépendance de (f,) et (y,), on a: 
(3.7) sous IP 


Zo? 


conditionnellement ad p=(p,,t29), le processus (0,—0,) est une 


‘a 
‘ - P ‘ Ss 
martingale gaussienne, centrée, de processus croissant fs. 
0 p s 


Nota bene: Le résultat précédent peut aussi étre obtenu en exprimant 
if # ’ . . 
=5 (Ss+53) en coordonnées polaires (cf., Ito-McKean [6], p. 270), mais la 
x y 
présentation ci-dessus est cohérente avec l’esprit de l'article, et sera encore utile 
par la suite. 


4. Fonctions, et processus, de Bessel 


Ce paragraphe est consacré a4 la démonstration des résultats (1.4) de Hartman, 
ainsi qu’a l’obtention de la formule (1.3). 

Introduisons tout d’abord quelques notations relatives aux processus de 
Bessel: on appelle processus de Bessel d’indice v(eEIR), et on note BES(v), la 
diffusion 4 valeurs dans [0,00), de générateur infinitésimal: 


id jQv4l d 
tr ini 
(pour une définition plus complete, voir J. Kent [10]). 


On désigne quelquefois d,=2(v+1) comme la «dimension» de BES(v): en 
effet, la partie radiale du mouvement brownien 4 valeurs dans R‘ est le 


(4.1) A 


d 
processus BES 6- ) Dans la suite, ’indice v=0, cCest a dire d,=2, joue un 


role fondamental. 
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Sur lespace Qg, = C(R,,R_,), on note p,(m)=a(t), §,=o{p,,s St} (t=), et 
&=o{p,,s20}. Pour tout ae[0,0), P” désigne la distribution, sur (Qe, , §), de 
BES(v), qui vérifie P [p)=a]=1. 

Remarquons que, d’aprés (4.1), pour tout a>0, et tout v20: 


a ‘ds 
(4.2) B,=p,- oo 
0 Ps 


est un (P, §,) mouvement brownien réel; ce résultat généralise la formule (3.5), 
qui correspond au cas v=0. 

Les résultats suivants, dis 4 J. Kent [10], jouent un réle important dans le 
reste du paragraphe: 


— pour tout v>—1, x>0, t>0, la fonction de transition P’(x,dy) de BES(v) 
admet une densité p}(x, y) qui vérifie: 


ae 1 _x?+y? 
(43) (2) rawy=re 2 1 R2)y ro 


d 
(lorsque ec 1, deIN*, cette égalité figure dans Ito-Mc Kean [6], paragraphe 
27), 


— pour tout b>0, notons t,=inf{t20/p,=b}. Alors, pour tout A>0; 


Lay 24) 
LQ Y(by 2 a) 
ou 2=K, si veR, 0<b<a<o, et 2=/ si v>—1, 0<a<b<oo. (Voir aussi 
Getoor-Sharpe [2], proposition (5.7).) 
La clé des théorémes qui figurent dans ce paragraphe est le 


(4.4) E\[e~**]= (: yy 


(4.5) Lemme. Soient a un réel strictement positif fixé, et T un (§,..) temps @arrét 
borné. Alors, pour tout couple (u,v) @indices positifs ou nuls, on a, pour toute 
variable positive Y, §,+-mesurable: 


eS 4 p = we Tds -v 
46 Bz | Yex 1-5 (=) ‘|- E;[Yex (-4 ) (ez r) | 
16) Belvo LHC: (3h 
Démonstration. 1) Démontrons tout d’abord (4.6) pour n=0, et v>0. (4.6) 
équivaut alors, 4 l’évidence, a l’identité: 


° alr (2) oo(-Z]8)] e010 


pour toute variable positive Y, (§,.) mesurable. 
2) Considérons la (P°, §,) martingale locale: 


t dB, 2 tds 
Lysexp|y -7 4st 
2 


0 Ps 


= (2) d’aprés (3.3). 
a 
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L’inégalité évidente: L,< (*:) , et Pinégalite de Doob dans L?(1<p<o) 
a 
entrainent que, pour tout pe]1,o[, et t>0: 


E [sup(L,)?]<oo. 
st 


En particulier, pour tout v>0, (L),. est une (P°, §,) martingale. 
3) Soit o une constante positive telle que: T<o. D’aprés 2), Q°=L, -(P°|%,) 
est une probabilité sur (Q,.,%,). D’aprés le théoréme de Géirsanov, 
tds , 
B,=B,—v| —(tSo) est un (Q°,(%,),<,) mouvement brownien réel. 


OFs 
On réécrit maintenant l’égalité (3.5) en: 


e 2v+1 ‘ds 
(3.5) p,=a+B,+ 2 = 


ce qui, d’aprés (4.2), montre que, sous Q°, le processus (p,),<, est un processus de 
Bessel d’indice v. 


On a donc, pour toute variable positive Y, (%,.) mesurable: 


a F 
p\" v* ds 
F° [y (°*) exp} —> J Sat |-en0res) 
SE°(YL,] 
= Eo, (Y)=E}(Y). 
(egalité (i) découle de ce que (L,),.9 est une (P°, %,) martingale). (+) est donc 
démontrée. 
4) Enfin, (4.6) est une conséquence de (*), appliquée d’abord avec l’indice 


H=0, puis avec lindice v>0. O 


Remarque importante. Dans son article [8] (voir aussi, Kazamaki et Sekiguchi 
[9]), Kazamaki donne un exemple montrant que son critére: 





| exp {4M,} sous-martingale | pour que exp{M,—4<M),} soit une martingale 





est meilleur que le critére, antérieur, de Novikov [13], a savoir: 





| pour tout t, E(exp}<M),)<« F 





Remarquons que le critére de Novikov ne s’applique pas non plus a la 
martingale (locale) L,=exp{M,—4<M),} qui figure dans la démonstration 
précédente, car: 


: 1 
(Mya {5 et E? (=)=% (s>0). 
OPs s 


Par contre, le critére de Kazamaki s’applique a Tévidence car: 


M a . me ; ; 
exp} = (*:) . Autrement dit, voici un autre exemple illustrant la puissance 
a 
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du test de Kazamaki (qui, soulignons le, est loin d’avoir été utilisé dans toute 
sa force. 0 


Les deux théorémes suivants constituent, 4 l’évidence, lorsque l'on prend 
u=0, une redémonstration probabiliste directe, respectivement du résultat de — 
Hartman et Watson (voir l’introduction), et des résultats (1.4) de Hartman. 


(4.7) Théoréme. Pour tout couple (u,v)eR, x R, et t>0, ona: 


I ‘ad 
y2 tds 7 (u? + v2) 1/2 t 
(4.8) Bs [ex - j [P= =p|= —_—__— 


2 op (22) 
i : 


Démonstration. Appliquons (4.6), avec le couple d’indices (, 4), ob 2 =(u? + v?)'/?, 
au temps T =r, et avec la variable: 


we t 
Y =fa)ex} . jest 


f désignant ici une fonction borélienne positive. II vient: 


Be [rover (—F 17) (2) | 


s 


-e:[r00() 


% 1 24 y2 
J dy f(y)—exp (_* y ) I, (~) y  (daprés (4.3)). 
0 . 2T Tt 


y2 t 
Désignons par G une version borélienne de E* [exp}—" z} ) i o- ° | Il vient 


alors, en appliquant 4 nouveau (4.3): 


oo ;' A _a@+y? ay 
J dys) G0) exp ( )1, ( )y 


2T Tt 


- 1 2+ y? Bt 
= { dyfiy)—exp (-S ales *) y, dot (4.8). O 

oO 
Remarque. A Yaide de la propriété de Markov, on peut écrire la version 
symétrisée suivante de (4.8): pour tout couple (y, v)eIR , x R, et 0<a<t<a: 


I (2-6) 
(u2 + v2)1/2 —2@ 


Fz [exp(—" 5553) |ee-ea]- 1 (et) 


*\t-—o 





Le corollaire suivant découle immédiatement de (3.7). 
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(4.7.1) Corollaire. Pour tout «eR, on a: 
a? tds 
(49) EE, (*-9/p, = p) = Ef (exp = ie p)= 
oP Ss 


Remarque. Au début de ce travail, la formule «intégrée»: 
pa 
I 
la] (2 t ) 
pa 
10(F) 
0 (* 
obtenue par Ito-McKean ([6], p. 271), nous a suggéré la formule (4.9). 
Rappelons que, a l'aide de cette formule «intégrée», Ito et McKean 
2(0,—9o) 
logt 
converge en loi vers la distribution de Cauchy de paramétre 1, lorsque (t— 0). 


Le corollaire suivant permet, en particulier, d’établir la jonction entre le 
théoréme (4.7) et ’étude menée par J. Pitman en [15]. 


E., (eia(@ na — J P.,(p,€d p) 


> 


déduisent trés rapidement le résultat de Spitzer [17]: sous P-_,, 


zo 


(4.7.2) Corollaire. Soit deIN, d=2, et m=5-1 m désigne un point fixé, non 
nul, de IR‘. 


(B,),>9 est un mouvement brownien, a valeurs dans RR‘, issu de 0. Alors, pour 
tout t>0, et velR, ona: 





2 © 
ce fexp— Sf tem amr] "eters 


l|um+B, ||? I,,,(r ||m||) 


Démonstration. On fait, dans lintégrale entre t et 00, le changement de variable 
= 1/v; puis, utilisant le fait que le processus (vB, ,,),. 9 est encore un mouvement 
brownien 4 valeurs dans IR‘, tendant vers 0, lorsque v->0, on applique la 


formule (4.8) avec w=p,, T= 1/t, p=, a=||m|. 


(4.7.3) Remarque. Si Yon fait, dans lintégrale figurant en (*), le changement de 
variable u=t+v, et que l’on utilise le fait que (B,,,,—B,, v2=0) est un mouve- 
ment brownien a valeurs dans IR‘, issu de 0, on peut énoncer, aprés introduc- 
tion d’une variable gaussienne G,,,,,,, 4 valeurs dans IR“, de moyenne (mz) et de 
matrice de covariance tIdps, indépendante du mouvement brownien d- 
dimensionnel (B,),, 9 issu de 0, la version abstraite suivante de (*): 


y2 2 du 
a ae G,. l= 
[ex ( 2 lea (me, ) p| 


y2 ie@) du 
a [exp (-! ETA) | Gaol =p| 


_ Tyga ml) 
1,4(0 Imll) 
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(4.10) Théoréme. Pour tout b>0, on note t,=inf{t20/p,=b}. Pour tout triplet 
(u,v, B)eR, x Rx R*, ona: 


ay by" 24,2 4,212(aV2 
(4.11) EX [ex} 5 14:60 |- (*y Qu2ev32(@V 2B) 
ots " L424 212(V' 2B) 





ou 2=K, si b<a; L=I, sia<b. 


Démonstration. Appliquons (4.6) pour le couple d’indices (, A), ob A=(p? + v?)'/?, 
2tpat 
au temps T=t, A t (t 20), et avec la variable Y=exp ‘f i} a B(t, A at 2(P 2, at)» 
0 Ps 


g désignant une fonction borélienne positive. II vient: 


»2 RAT Ts ee —u 
Et [=| -5 “Bites end Pass)" 


s 


(4.12) & 
=F? [exp(— Been )) g(P x, a0) (Pass) | 


a 


Il nous faut maintenant discuter selon les positions respectives de a et b. 


4 
1) Supposons b<a. Alors, p,,,,25, et donc: (Poss) < On peut donc 
faire tendre t vers +00 dans l’égalité (4.12)”, ce qui donne, avec g=1: 


() “e[o>-Tri eres] 


b\-4 K 2 - 
= (-) E} [exp(- pry| ee d’aprés (4.4), 


dou le résultat cherché. 
2) Supposons a<b. Alors, p,, ,, <b. 


De méme que précédemment, on peut faire tendre t vers+0 dans légalité 
(4.12), od lon prend g(p)=p*. On obtient finalement (4.11). 0 


Le corollaire suivant découle de (3.7). 
(4.10.1). Corollaire. Avec les mémes notations que dans le théoréme, on a: 
2, (aV2 
(4.13) E., [exp {iv(6,,-0)— B25} ] = 
2), (0V 2B) 
En faisant décroitre B vers 0, dans l’égalité (4.11), on obtient le 


(4.10.2) Corollaire. Pour tout couple (u, v)eR , x R, et 0<b<a<oo, ona: 


y2 tds b w+(u2 + v2)1/2 
E* [w}-5 St: <0 |= (2) re 


Il faut toutefois prendre garde au fait que, pour tout p>0, et 0<b<a, on a: P(t, =00)>0. Voir, 
a ce sujet, le corollaire (4.10.2). Par contre, si b2a>0, on a, pour tout p20, P*(t,=00 )=0 


2 
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b\?# 
En particulier, P*(t,<00) = (") ; 
a 
En conclusion de ce paragraphe, remarquons que, d’aprés l’égalité (4.11), 


considérée en p=O0, lapplication: (A,f)eR, x R* _, 2ynilaV 28) 
@ lb 2B) 
transformée de Laplace d'une mesure de probabilité sur R , x R.,, ce qui étend 
a la fois les résultats (1.4) de P. Hartman, et (4.4) de J. Kent. 
Enfin, ’énoncé suivant permet d’introduire les principales notations du 
paragraphe 5. C’est un corollaire du théoréme (4.7) (appliqué avec p=0), 
modulo un argument de continuité, et de (3.7). 


est la 


(4.14) Proposition. Pour tout r>0, il existe deux mesures de probabilité p, et n,, 
appelées respectivement distribution développée de von Mises (sur R) et distribution 
de Hartman-Watson (sur R ,) telles que: 


T(r) _ ivx 
i. ree du,(x), 


(4.16) pour tout A>0, =a =f f e~*“dn,(u). 
I,(r 0 


(4.15) pour tout velR, — 


De plus, on a: 


= —x?/2u 
(4.17) dp,(x)= (i dn,(u)e Tea) 


t 


et, sous IP_,, la loi conditionnelle (sur IR, x Rj) du couple (0,-09;15) quand 
oPs 
p.=p est égale a: 


1 
(4.18) e~*/24dn (u)dx, ou rae. 


V 2nu 


5. Etude des densités respectives de yu, et 1, 


Soit r>0, fixé dans tout ce paragraphe, dont l’objet est d’obtenir une expression 
aussi explicite que possible de p, et n,. 


On se sert de fagon intensive de la représentation intégrale suivante (cf. 
Petiau [14], p. 123) de I,(r) (r>0; AER): 


1 
— (rch —dw) J 
(5.1) I,(r) Fin ye , 


ou C,, (uelR) désigne le chemin indiqué par la 


Fig. 1 




















84 M. Yor 


(La fonction: w+e""°~*° étant entiére, ’'intégrale qui figure en (5.1) ne dépend 
évidemment pas de uw). 

En intégrant le long des deux horizontales, et de la verticale qui constituent 
Co, on obtient: 
sin (An) fenrem—a dr, 
0 


(5.2) L)=— f e” ©" cos (40) dd — 
0 


Il nous faut encore introduire, afin d’expliciter ,, la fonction ®,: R\{0}—R, 
définie par: 


oo 
“2 eo reht 
ni 


(5.3) ®,(x)= (xeR\ {0}), 


foo 


d 
(5.3), een > 


me: 
T 
On déduit immédiatement de (5.3), que #,(0 JES , ce qui permet de prolon- 


, , , : e 
ger ®, en une fonction continue 4 droite sur tout IR, en posant ©,(0)= 5 


assis a encore les mesures: 


m(d0)=5— | t-x,xi(9)d0; x,(dt)=e"™ rsht dt (x, est une mesure positive, définie 


tdx 


n(t? +x?) 


sur R.,, de masse totale e~"), et enfin c,(dx)= 
Cauchy de paramétre t. 
On peut maintenant énoncer le: 


(t>0) la mesure de 


(5.4) Théoréme. Soit r>0. 
(i) u, se décompose en la somme p,+q,, ou: 


(5.5) p,(d0)= e791. (9)d0 est la distribution de Von Mises de 
2nI,(r) : 


parametre r. 


(5.6) q,(d0)= i j \ —e-"m+m* j 7,(dt) a 


(5.6’) =—5 a —___ {@ (0—n)—6,(0 + 2)} d0 


est une mesure bornée, de masse totale nulle. 


De plus, si Con note f, la densité de ,, définie par: 


1 
(5.7) S(D= In I aa 1 x29) +57 ; o(”) 


la fonction f: (R.,\{0})xR—R,, est continue. 
(r, 0) f,(9) 


{®,(0—12)—9,(0+7)}, 
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(ii) n,(du)= 


1 
n2/2u 
Qui? e iA os yr (u)du, avec 


= t —x?/2u ,—rchx . mx 
(5.8) W,(u)= ! dxe e (shx) sin ( 


La fonction ainsi définie sur (IR, \{0})* est continue. 


Démonstration. (i) Remplagons, en (5.2), 4 par |v| (velR). On a: 


| | sat : 
— f e”©°** cos(v0)d0 =— fe”? <tts, e”>d0. 
To To 2 


n 


L’application: ee ry e”°°°* cos(v0)d0 apparait donc comme la transformée 
0 0 


de Fourier de la probabilité p, donnée par (5.5). 
D’autre part, écrivons, pour v+0: 


sin (|v| 7) 


_ Sin (jv| 2) fer e- bil gp = [Re] (—1) j en reht { 9— lv vit Hyf} dt 


Tu 0 
expression qui se transforme, par intégration par parties, en: 


sin (v7) 
vr 


sin (vz) 


La fonction: v> : (prolongée par 1 en v=0), resp: v>e~'!'(t>0) est la 
T 


[-e+ j n,(atye-""|. 


transformée de Fourier de m, resp: c,. 


L’expression obtenue précédemment est donc, comme fonction de v, la 
transformée de Fourier de 


A=—e-'m+ms« { 2,(dt)c,, 
0 


d’ou la formule (5.6). 
La mesure A a pour densité: 
cd 0 _ ~f d j ne 
- ae In, +nt(9) sn n[t?+(0—x)?] 


Lorsque l’on fait le changement de variable t=|0—x|y (en t) dans lintégrale 
double, celle-ci se transforme en: 


= ~rch{|@— xy] ee! Ase 
L()= if dx { dye rsh[|0@—xly] ore 


ones dx sgn(0—x)e~"*@-)»l rsh [(@—x) y] y. 
0 m(1+y") *. 
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Discutons maintenant les cas: |6|>z, et |O0|<7; 





a)| 0>nx 











«o 


dy 
L(@)= —rch[(0—n)y] __ p—reh{(0+x)y] 
Om | ary © j 


=@(0—n)—©®,(0+7), daprés la formule (5.3), . 





0<-—nx| : Pour tout xe]—7z, +2[, sgn(@—x)=—1. 





formule (5.3), assure encore que L(@)=,(0—1)—©®,(0+7). 








Ol<n |: 
ats dx e~"@-Irsh[(@—x) y]y 


— i dx e~"hlO-¥l poh [(0-x)v1y| 
6 


t dy -r —rch[{(0+ 2) y] —rch[(@—2)y] -r 
j ——- [e’-e le Nt e-"} 


~ 9 a1 +y?) 
=e '+0,(0—2)—®,(0+7), 


dot la formule (5.6’) en toute généralité. 

La continuité de f, définie par (5.7), se vérifie ais¢ement (les seuls points de R 
ou la continuité en @ n’est pas évidente a priori sont —2 et +7, mais le choix 
que nous avons fait pour ®,(0) entraine le résultat cherché). 

(ii) Rappelons la formule: pour tout x >0, et A420, 


1 


—(V24)x t 
(5.10) e “ e~ "Onudyi xe 


Sidiiee! 
Cette formule est classique: elle exprime la transformée de Laplace du temps 
d’arrét T,=inf {t/B,>x}, ot (B,),> 9 désigne le mouvement brownien réel issu de 


D’autre part, pour tout e>0, sur l’ouvert 0,={z=x+iy/x?—y?>e,x>8}, 


l'application: z—> f e~™ Qausyia2° ~=7/24 du est holomorphe, et l’égalité (5.10) est 
0 


donc encore vraie lorsque l’on remplace x par z€0,. On peut donc écrire, d’aprés 
(5.1): 


higubiieas f doe [em au —____ me!" du 


2nu*)!! 


1 rch oy e~27/2u | 
Ee © Sail dw}, 


2in 


application du théoréme de Fubini étant licite. 
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La mesure y,(du) est donc absolument continue par rapport a la mesure de 
1 


1 
T(r) 2nu3)'? 


Lebesgue, et admet pour densité: H(u), ou 


1 
H(u ee j dwe™ w e707 /2u . f dave?" we 07/2", 
2in 2ix 
Cate Co 


la fonction intégrée étant entiére. 
En intégrant sur les deux horizontales et la verticale qui constituent Co, il 


vient: 

1° _(x+in)? _ (x—in)? 
HW=5,, | due" ie +iaje 2u —(x—in)e 2 | 
Te 


00 1 _— 
ete fdxe-"™e i acy [- 2x sin (= )+2nc0s (=) 
27 5 u 


ru 2, ar “3 ;. $Me 
=— er'/2m { dx e~**/2" e~"*(shx) sin (= ; 
Tt > u 


par intégration par parties. On a finalement obtenu la partie (ii) du théoréme, la 
continuité de y sur (IR, \{0})* étant évidente. O 


Remarque importante. On peut également démontrer la partie (i) du théoréme en 
remarquant, a l’aide de la formule (5.1), écrite en u>0, que I (el (dv). On en 
déduit, par inversion de la transformation de Fourier, que: 


f.(Q=——-— oe eal > . 2 ei” e-vlody 


do, 
(5.11) - Z i we 
2inI,(r) é, 


rk. AR 
1(07 +?) 


(Il découle de cette formule que f est indéfiniment différentiable sur 
(R,,\{0}) xR). 

Si |6|>z, on peut prendre u=0, puis constater, aprés quelques calculs, que 
lon a obtenu la formule (5.7). 

Si |09|<z, on remplace C, par le domaine d’intégration indiqué par la 


iR 
im 
id 








-i6 








ce qui, 4 nouveau, permet de retrouver (5.7). 

Nous préférons 4 cette méthode la premiére démonstration que nous avons 
donnée de (5.7), car elle fait apparaitre diverses probabilités (cf.: la formule (5.6)) 
qui ont peut-étre une interprétation dans l’étude du lacet brownien. (Voir déja le 
corollaire suivant). 
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Notons encore une formule analogue a (5.11) pour la densité g,(t) de dy,(t) 
par rapport 4 dt (sur R,), formule qui a été dégagée au cours de la 
démonstration précédente: 


er 1 rcho a agial 
(S.21) &O- FTW 6 1° a 


Ajoutons enfin quelques remarques complémentaires sur la _ fonction: 
x>0-— @,(x) 


a) il est immédiat, a partir de (5.3), , que: 


1 n/2 
(5.12) #,(x)=— J e~rehtxtey) dy (x>0). 

0 
b) Il est plus important de remarquer que la transformée de Fourier de: 


x 
t—+e-" est: E> Kyz(r), et celle de: t——- (x >0) est: Ee !4!*, 
> f— Kia(r) at? + x) ) g 


On obtient donc, en conséquence du théoréme de Plancherel?, et de (5.3): 


1 @ 
(5.13) ®,(x)=— | Kya (rye dé 
0 


c) La formule (5.13) permet d’obtenir, grace 4 la méthode de Laplace, 
Péquivalence suivante, lorsque xf +00: 


‘ 1 acess : 
si u<v, ©(x+u)—O(x+v)~—K,(r)(v—u)x~?; en particulier, si 
v=—u=7, on a: - 


(5.14) @(x—n)—O(x+n) ~ 2Ko(r)x~?. 


|x|— 0 


Remarquons, pour étre complet, que lon a immédiatement, a partir de (5.13): 


1 
®,(x)=— Ko(r)x" * (|x| 00). 


6. Loi de indice du lacet brownien 


Le lemme (6.1) permettra de ramener le calcul de la loi conditionnelle de 
(0,—0), quand Z,=z, sous P_,, a celle de (0,—0,), quand p,=p (cf., corollaire 
(4.7.1). 

On emploie indiffeéremment la notation complexe z=x+iy, ou bi- 
dimensionnelle (x, y); si zeC\ {0}, 0, désigne la détermination de l’'argument de z 
appartenant a [—7z, +7[; si z,z’eC, on note ¢z,z’) =xx'+ yy’ le produit scalaire 
usuel de z et z’. 


Je remercie M. Barrucand qui m’a fait cette remarque 
On obtient une représentation du méme type pour la fonction y, (cf., formule (5.8)). Les calculs 
correspondants - inutiles ici - seront détaillés dans une Note séparée. 
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Soulignons que les hypothéses faites dans l’énoncé qui suit ne sont pas 
minimales, mais elles suffisent largement a nos besoins. 


(6.1) Lemme. Soient R et @ deux v.a. définies sur le méme espace de probabilité, 
respectivement a valeurs dans R_,, et R. 


R,xR-R 
Soient U: R, >R,, et V: * deux fonctions boréliennes telles 
(p,u)— V,(u) ' 
que: 


dPa(p)=dppU(p) et dPyp_,(0)=d0V, (8). 


Alors, si 'on note Z=Re®, on a: 


(i) dP,(x,y)=dxdyp(z), ot p(z)= >) U(\z|) V,,(0,+2nn) (z+0). 
neZ 


i U(\z|) Y.(0, +2 
Gi) Py. Yo 
neZ p(z) 
(iii) Si, de plus, les applications U, V et p (supposée identiquement finie, et 
strictement positive) sont continues, respectivement sur IR* ,IR* xR, et C\ {0}, 


alors, pour toute fonction continue, bornée et a support compact ¢: RR, 
application 





E0. +2nn)* 





Og: z— y U ((z|) V,.\(0, + 2nm) 


o(0,+2nn) 
neZ p(z) 


est continue sur C\ {0}. 


Démonstration. (i) et (ii) ahoowlent simplement de la décomposition de l’intégrale 
2nn+n 


(générique!) en d0, soit t do[..Jen } f dO{[...], et du changement de 


néeZ 2nn—n 


variable 0=2nn+u dans Tintégrale correspondante. 
Pour démontrer (iii), on commence par remarquer que l’on peut remplacer - 
a lévidence - dans la définition des fonctions p et LJ, la détermination 
principale 0, de l’'argument de z¢C\ {0} par n’importe quelle détermination. 
L’assertion (iii) est alors une conséquence immédiate de lexistence, au 
voisinage de tout point ueC\ {0}, d’une détermination continue de l’'argument. 


Remarque. Si U et V sont continues, respectivement sur IR*, et R* xR, et si 
P,(dx x dy) admet une densité p, continue (finie) par rapport a dxdy, alors la 
fonction p du lemme (6.1) est identiquement finie sur C\ {0} (et égale dxdy p.s. 
a Po). 


Rappelons maintenant que, selon les notations du paragraphe 2, IPX (zz) +0) 
désigne la loi du pont brownien complexe PB(zo,z) (entre 0 et t>0); on avait 
pris, pour simplifier, t=1 dans le paragraphe 2. 


2,2 
(6.2) Théoréme. Soit z,+0. On note p= 2170 et A0,1)=9.,.51- Pour toute 
T 
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fonction h: R— R borélienne, bornée, on a: 


<21,Z0> 


(6.3) IEE (h(0,—0))=e —* {2a o()} ¥ Fo, 1) +2nR) (Oo, 1, +207)! 
neZ 


_ <21,20) 
(6.4) =h(Oo, 1) +e " : [P,(0.0, 1) +(2n—-1) 2)— (86, 1) 


neZ 


+(2n+1)2)] h(O,o, ,).+2n7) 


, : z 
Démonstration. Posons Mo= ET Z,=ZUg, et a=|z,|. Avec ces nouvelles nota- 
Zo 


tions, l’égalité (6.3) se transforme en: 


(6.3’) IE (h(0,—0,)) = y 2nlo(rde * f,(0, +2nn)h(0, +2nz). 
Il suffit, pour démontrer (6.3’), d’appliquer le lemme (6.1) avec Z=Z,up' et 
#=0.—0,). Remarquons que R= ,, et que, avec les notations du lemme 

a) U ate a e (0°) (d’aprés (4.3), ou par un calcul direct). 
b) VO=L0. 


c) p(z)=5_e ? 


|z|?+a2 ax 
[ Tt 5 Fy 


(Cette identité découie, de fagon simple, du lemme (6.1), de la remarque qui le 
suit, et de la formule (5.7) explicitant f.) 


Les hypothéses du lemme (6.1) (partie (iii)) sont bien vérifiées grace au 
théoréme (5.4). Ainsi, si h est une fonction continue, bornée, 4 support compact, 
les deux membres de (6.3’) sont des fonctions continues de z(eC\ {0}) (le membre 
de gauche lest d’aprés le lemme (2.12)), qui sont égales dz ps. et donc 
identiques. L’égalité (6.4) découle de (6.3), et de la formule (5.7). 0 


Le résultat suivant est une conséquence immédiate de (5.14). 


(6.2.1) Corollaire. Avec les notations du théoréme (6.2), on a: 


R K (r) _ 21,20) 
IP?! (0, — 09 =o, y+ 2ne) = Fag! mn. sos 


Le théoréme ci-dessous étend les résultats du théoréme (6.2). 


(6.5) Théoréme. Avec les notations du théoréme précédent, on a, pour tout z, +0, 
et toute fonction H: R x R., > (, borélienne, bornée: 


t 6 
IE: (1 (0,04; 5) 

OPPs 
(6.6) és, _ (2nx+%o, 1)? 


a Zo> © yp. 
=Irje fan VC eG o,1)+2n03u)] 
0 


neZ 


> ¢f., la formule (5.7) ov est définie la fonction f 


Rappelons que la mesure n, a été définie au moyen de la formule (4.16), et explicitée dans le 
théoréme (5.4), ii) 


6 
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Plus particuliérement, pour toute fonction K: R., >, borélienne, bornée, et tout 
AeR, on a: 


IE: (e28-00 K (; “)) 
0 p: 


«21, AL a a _ eae Mavi 
=I,(r)e J dno e = 1) Kw) 


neZ 


(6.7) 


et enfin, on a, pour tous A, veR: 
—v2/2§ ds/p? 
Z1(pidA(Oc— 90) . 
IEZ: (e e 0 
_ (21520) 


~ —in@o, 
‘i S y e~ indo ” Tens ay24 v2]! 2(r) 
neZ 


(6.8) 


Démonstration. 1) La démonstration de la formule (6.6) est tout a fait semblable 
a celle de (6.3), la seule difference consistant 4 faire usage maintenant de la 
formule (4.18). 

2) La formule (6.7) découle de (6.6), ot l'on prend H(0,u)=e'’K(u), via 
Pégalité (on note =e ici 8 pour A 5)): 


_ (2nn+ 0)? 


tb) e i ef A0+ 2nzy) — 3 ooh 


neZ neZ 


2 
(n+ A)?u nid | 
’ 


elle méme conséquence immeédiate de la «formule de Poisson»: (cf, par exemple, 
[18], théoréme 5, p. 153) 


—n(n+z)2a __ —nn2/a+ 2innz 
ee 


neZ 7 neZ 


pour tout a>0, et tout zeC. 
3) Enfin, remarquons que: 


(n+ A)?u 


fdn(w[Ye 2 J<o, 
0 neZ 


d’aprés la formule (6.7), prise en 0,9, ,,=0. On a donc, d’aprés la méme formule: 


—v?/2 § ds/p2 
0 


ES (ei4(-— 9) e 


<21,Z0> 
=I,(r)e x >. enti as “Of 
=!o 


neZ T(r) 
d’aprés la définition de n, (formule (4.16)). 0 


Avant d’étudier le cas particuliérement intéressant ot z,=z,, donnons une 
autre version de certains des résultats du théoréme (6.5): soit d=>2, et m,m’ deux 
points non nuls de R‘; si (B,), > est un mouvement brownien a valeurs dans R‘, 

. du 


issu de 0, la distribution 2gn.my de | 


|m+m'u+B,|2 est symétrique en met m’, 
0 u u 
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C'est A dire: Ap, m’)=Aim’.my> C& Qui se démontre, comme pour le corollaire (4.7.2), 
en utilisant le fait que (vB, ,,),,, 9 est encore un mouvement brownien. Pour d=2, 


ae , a rs d 
la méme propriété, appliquée au calcul de la loi conditionnelle de | ~ 


; |\mu+B,\? 
quand m+ B, =m’, nous donne, d’aprés la formule (6.7): pour toute fonction K: 
R., —C, borélienne, bornée, 


J K(%)d dm. 9( 
(6.9) 


nu 


=1,(r)e~<™™ i dn we! 5} +i. a}, K(u) 


ou r=||m|| |lm’||, et Ao. 1) =n m-1- 
Les théorémes (6.2) et (6.5) permettent en particulier de calculer la loi du 
. e 1 sales 
couple (, j =), sous Pt —P®, ou l'on note N,=5—(0,—9o) Pindice de 0 par 
o Ps 2 
rapport au lacet brownien (Z,(w),0<uSt). On note dorénavant patel 
(6.10) Theoréme. 1) Sous IP*, la loi de Pindice (N,) est: 


(6.4*) Eo(dm)+e-" ¥ [@,((2n—1) 2)—©,((2n +1) )] e,(dm) 
neZ 


t 
2) Sous IPS,, la loi du couple (x, iS) 
0 Ps 


est: 2n? x2 


(66%) Io(her"dn.w|/ LY e+ eld} 


Pour conclure, soulignons que la fonction caractéristique de N_, sous IP*, a la 
forme particuli¢rement simple suivante: 


(6.8*) Es (e7***)=e" SY Tin, 


neZ 


7. Extension de certains résultats en dimension supérieure a 2 


Dans tout ce paragraphe, d désigne un entier supérieur 4 2, et pour tout meIR‘, 
P,, est la loi - définie, comme d’habitude, sur l'espace canonique C(IR , , R*) - 
du mouvement brownien a valeurs dans IR‘, issu de m en t=0. 


t>0 étant fixé, le but du paragraphe est de donner, de fagon aussi explicite 
que possible, la loi de { ds/\|B,\?, sous IP,,, conditionnellement 4 (B,=m) 
0 
(m,m’ +0; on note encore IP” la loi du pont brownien ainsi défini entre 0 et 1). 
Rappelons, en préliminaire, les résultats classiques suivants: 


(7.1) Le processus (B,) peut se représenter comme «skew-product» de sa partie 


radiale p,=||B,\|, et de sa partie «angulaire» o,= “ia :)* valeurs dans la sphére 
s/p; 
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unité S,_,, le processus (0,),.9 étant un mouvement brownien standard a 
valeurs dans S,_,, indépendant de p (cf. Ito-Mc Kean [6], p. 270) 

(7.2) La densité, au temps t, du processus (9,),.9 4 valeurs dans S,_,, issu de 0 
en t=0, par rapport a la mesure de surface do() sur S,_, est donnée par: 


© mateo 3ht 2)t 
H0.8)= ¥ € ME” pd; <0,0>) 
n=0 4 
(cf, Ito-Mc Kean [6], p. 275, ou F. Perrin [19], formule (59), p. 25), ou: 
N(d;n) est le nombre d’harmoniques sphériques linéairement indépendantes, 
de degré n, en dimension d. 
w, est la surface de la sphére unité de R‘. 
P,(d;.) est le polynome de Legendre de degré n, en dimension d. 
<9, p> est le produit scalaire des points 0 et ¢ de S,_ , CIR’. 
On voit aisément, en conséquence de légalité: P(d;0)=1, que la série 
définissant p“ est majorée, terme a terme, par: 


—mntee 2)t N(d; n) 


Wg 


e 


et est donc uniformément absolument convergente. On peut maintenant énoncer 
le: 


d 
(7.3) Théoréme. Notons Ma=5— 1. Soient m,m’eR‘, non nuls; 0=m/\\m\\; 


m||_ ||m’ ; ca 
0 =m'/\|m'||. On note encore pall tel Alors, pour toute fonction borélienne 
t 


¢@:R,—-R,, ona 


t <m,m’) 
(7.4) IE” lo (j >) eal “anyi2 tol”) tel rt fener Hae? b(v) p (0; 0’) 


OPs 


et, en particulier, pour tout veR: 


, -v?/2§ ds/p? 
EF, o> ) 


_ <m,m‘) (2n)4/2 


(7.5) 
a : rita LS Y Fuse na?s pal) —— 


ME") pa; 0,0)| 


Démonstration. La formule (7.5) découle immédiatement de (7.4), ot l'on prend 
o(v)=e-””?, de (7.2), et enfin de la définition de y, par sa transformée de 
Laplace (formule (4.16)). 

Afin de démontrer la formule (7.4), supposons @ bornée, et introduisons 
f:IR‘—>R, borélienne, bornée. 


On a alors, d’aprés (7.1), en notant V, =f ds/p?: 
0 


E,, L/(B) @V)] 
=|P,,(p,edp; V.edv)(v) | do(d) p® (0; >) f(po) 


Sa-1 


=JP,, (.€dp) J dripimy (0) (0) | me do(d) p (0; b)f(o9), 
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ou, d’aprés la formule (4.8), n@ (dv) désigne la probabilité sur R,, dont la 


I 12(r 
transformée de Laplace, au point (") est égale a: Tugel) On a, a 


27 nal") 
mC : dn,(v) e~ "2", 


On termine la démonstration en explicitant la densité de P,,(p,¢dp) par 
rapport a dp, puis en reprenant les coordonnées cartésiennes 4 la place du 
couple (p,¢). O 


Si on prend ¢=1 dans la formule (7.4), on obtient le: 


Pévidence: dy (v)= 


(7.3.1) Corollaire (Hartman-Watson, [4], p. 594, identité (1.5)). Avec les mémes 
notations que dans le théoréme, on a: 


<m,m' > 


rea 
: Qn? Io(r )f dngto)e wal? 190; 0). 


Soulignons ici que, d’aprés les arguments qui précédent l’établissement de la 
formule (6.9), on a, pour m,m’ +0, et (B,) mouvement brownien a4 valeurs dans 
R‘, issu de 0: 


2) are 
E(e. |\m+m'u+ B, || )= IE" (e 


1 
—v?/2 f ds/p? 
0 


d’ou Pobtention de la mesure /,,,,,,°) sur IR, , pour toutes les dimensions d2 2. 


(7.6) Remarque. La formule (7.4) est, bien entendu, également valable pour d=2, 
et l'on retrouve alors (partiellement) les formules (6.6) et (6.7), respectivement 
pour H ne dépendant pas de 6, et 4=0. Le passage de (6.6) a (6.7) résulte 
d ailleurs simplement de la représentation du mouvement brownien standard sur 
S, comme l'image du mouvement brownien réel par la congruence modulo 272. 
Cette remarque a déja été faite par F. Perrin ([19], p. 22). En dimension d>2, le 
méme auteur donne (formules (61), p. 26) une expression de p(0; 0’) différente 
de (7.2), mais qu’il semble difficile d’interpréter du point de vue probabiliste. 


8. Remarques finales 


a) J. Pitman étudie, en [15], certaines propriétés de la norme du mouvement 
brownien a valeurs dans IR‘, avec drift constant. 

Aprés lecture d’une version préliminaire de notre travail, il a déduit la 
formule (*) (corollaire (4.7.2)) pour d=2, du théoréme (4.7). 

La démonstration du corollaire (4.7.2) présentée ici est postérieure a celle de 
Pitman, mais nous semble plus immédiate. 

b) Egalement aprés la lecture de ce papier, J. Henry nous a appris que la loi 
de lindice du lacet Brownien avait déja été calculée par S.F. Edwards [20] 
par une méthode tout a fait différente de la ndétre. 

c) La densité de la distribution de von Mises développée (Théoréme (5.4)) y, a 
été obtenue initialement par D.G. Kendall, et figure (p. 212-219) dans la thése de 
Ph.D. de J. Kent, intitulée: « Distributions, processes and statistics on spheres» 


(University of Cambridge - Novembre 1976) 
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Toutefois, 4 la connaissance de Kent, ce résultat ne semble pas avoir été publié 
jusqu’a présent. Nous n’avons pas trouvé trace non plus, dans la littérature, du 
calcul de la densité de n, (théoréme (5.4) également). 

Je profite de ces remarques pour remercier vivement J. Pitman, J. Henry et J. 
Kent de lintérét qu’ils ont porté a la réalisation de ce travail. 
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Nonparametric Estimation Based 
on Censored Observations of a Markov Renewal Process 


R.D. Gill 


Mathematical Centre, Kruislaan 413, Amsterdam, The Netherlands 


Summary. Uniform consistency and weak convergence is proved of esti- 
mators of the transition probabilities of an arbitrary finite state space 
Markov renewal process, based on n independent and identically distribut- 
ed “right censored” realizations of the process. The approach uses the 
theory of stochastic integrals and counting processes. It is shown how the 
results may be extended to the non-identically distributed case and to 
general censorship under suitable conditions. 


1. Introduction 


A Markov renewal process can often be used to describe the lengths of time 
spent in consecutive stages (not necessarily following a fixed order) of a 
disease, or in the functioning of a machine. In such a situation one might want 
to estimate the distribution functions (so called transition probabilities) which 
describe the probabilistic behaviour of this phenomenon. Lagakos et al. [9] 
(see this paper and its references for applications) proposed certain estimators 
on the basis of maximum likelihood considerations by maximizing the proba- 
bility of n realizations of the process observed on fixed finite time intervals 
over all discrete transition probabilities; they also derived approximate va- 
riances and covariances of the resulting estimators by looking at the Fisher 
information, again assuming discrete distributions. The present paper gives 
rigorous derivations of consistency and weak convergence of these estimators 
as n, the number of (partial) realizations of the process, tends to infinity. We 
make no assumptions on the transition probabilities and also allow quite 
general types of censoring: that is to say, the mechanism through which only 
incomplete realizations of the Markov renewal process are available. 

Let Jo, J;,... be the consecutive states of a Markov renewal process and let 
X ,,X,... be the sojourn times in these states (we take our notation from Pyke 
[13] apart from two changes which we mention shortly). So Jo, J,,... are r.v.’s 
taking values in the finite set of states {1,...,m} for some meN and X,, X,,... 
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are r.v.’s taking values in (0, 00] with the interpretation that the process starts 
off at time S,=0 in state Jo, and at time S,= y X,, after a stay of length X, in 
state J,_,, jumps to state J, (at least, if S, avn The joint distribution of Jo, 
X,,J;, X 2, Jz, ... is built up recursively from an initial state distribution 
%=P(Jo=k) ksm (1) 
and transition probabilities (which we want to estimate) 
0, (t}\= P(X, St,J,=j|J,_,=i),  i,jgm, te[0,0], neN, (2) 
satisfying Q;,(0)=0 by supposing that for each neIN 
P(X, St, J,= flo X tI -s Xp sq P(X Sb = il.) a8. 3) 


We shall also be interested in estimating the (possibly defective) distribution 
functions H,, defined by 


H,(t)=P(X,StlJ,_,=)=)/0,,(0. (4) 


Rather than working with the r.v.’s Jo, X,,J,,... we step over to counting 
processes N,; (Aalen [1]) which register the transitions made from state i to 
state j up to time t: 


N,(t)= #{n21:S,St,J,_,=i,J,=j} — i,jsm, te[0, 0). (5) 


Because the state space is finite, by Pyke [12, Lemma 4.1] these processes 
have sample paths which (almost surely) are finite for all t, zero at time zero, 
integer valued and right continuous with jumps of size +1 only, these jumps 
furthermore not occurring simultaneously in different processes. Define also 


N,.()=YNij(0) 


N (O=YNjO (denoted N,(t) in Pyke [12]) 


N()=PNj(0 (denoted N(t) in Pyke [12]) 


and define 
Z(t)=Jirw 


L(t)=t—Sia_y: (7) 


Pyke [12] calls the process Z, the state occupied at time t+, a semi-Markov 
process, and the multivariate process {N jj Sm} a Markov renewal process. 
L(t) is a left continuous version of what is called the backward recurrence time ; 
it is the length of time which at time t— has elapsed since the last jump of the 
Markov renewal process. It plays an important role in the sequel. 

We shall later use the fact that N(t) has finite moments of all orders. For, 
as m is finite, there exists a distribution function F such that F(0)=0 and 
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Q;(t)S F(t) for all i, j and t. So N(t) is stochastically dominated by the 
evaluation at time t of a renewal process having recurrence time distribution 
F; and for this process the result is well known. 

The rest of this paper is organized as follows. We shall conclude this 
section with some more notation and some facts from the theory of stochastic 
integrals and counting processes for use in Sect. 2. There we formally define 
our censoring model, by introducing a process K taking the values 0 and 1, 
with the interpretation that the Markov renewal process is “under obser- 
vation” at time t if and only if K(t)=1. We assume that K is predictable which 
loosely speaking means that K(t) does not depend on the development of the 
Markov renewal process in [t, 00). It is easy to construct counter examples 
where K does not have this property and the estimators considered here are 
inconsistent. Some further technical assumptions are made, such as a restric- 
tion to “right censorship”. Then we derive some important equalities involving 
the first and second moments of processes counting sojourn times in the 
various states “observed” to be <t and to be followed by a transition to a 
particular state, or just “observed” to be 2t. In Sect. 3 we give definitions of 
the estimators of Lagakos et al. [9] based on n independent and identically 
distributed censored observations of the Markov renewal process. The esti- 
mators depend on the data through the processes counting observed sojourn 
times. We now prove three theorems: on consistency of the estimators with 
arbitrary Q;/s, on asymptotic normality with discrete Q;;'s, and on weak 
convergence with arbitrary Q;/s. Theorem 2 with its easy proof (once Theo- 
rem 1 has been established) is included to illustrate Theorem 3, without the 
latter’s technicalities (which are themselves largely postponed to Sect. 5 in the 
form of some lemmas). Section 4 discusses these results and shows how the 
assumptions of right censorship and identical distributions may be relaxed. 


Notation. For a real valued stochastic process Y={Y(t);te[0,0o)} whose 
sample paths have left hand limits, Y_ is the process defined by 


Y_(0)=0 
and 
Y_(t)=Y(t—). 


In dealing with an indexed family of processes, we write e.g. Y,_ for (Y,)_. We 
define Y, similarly under the obvious conditions. AY is the process Y— Y_ and 
&Y the function t>&Y(t). By [XdY we denote (with a single exception in 
Theorem 3) the process with sample paths t> f X(s)dY(s), the latter being 


(0,1) 

interpreted as a pathwise Lebesgue-Stieltjes integral, such an interpretation 
being possible in all the cases considered. We write {~ X dY for the process 
(|XdY)_ with sample paths t> { X(s)dY(s). Some miscellaneous notations 


[0,t) 
are x, for the indicator function of the set A and ||-||, for the supremum norm 
on [0,7], also written ||-|| if there can be no confusion. Unless otherwise stated, 
time variables s, t, t, etc. are always in [0,00) and state variables i,j in 
{1,...,m}. The o-algebra generated by a family of r.v.’s is denoted by ¢{-}, 
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while to indicate that generated by a union of o-algebras we use the symbol v. 
Maximum and minimum are denoted by v and ~ respectively. 


Stochastic Integrals. We collect together here a few results from Meyer [11]. 
Let (Q,F,P) be a complete probability space and let {F,;te[0,0)} be an 
increasing right continuous family of sub o-algebras of Q such that A contains 
all P-null sets of &% A process X is predictable if as a function of (t,@) it is 
measurable with respect to the o-algebra on [0, 00) x Q generated by the left 
continuous adapted processes. A process X possesses a property locally if there 
exists a so called localizing sequence of stopping times {T,}, T,,f 00, such that 
for each n the process t>7;7, , 9, X(t A T,) has this property. 

Martingales are always supposed to have right continuous paths with left 
hand limits. Let M and N be local square integrable martingales and let H and 
K be predictable and locally bounded. There exists a unique predictable 
process <M,N) with paths of locally integrable variation such that MN 
—<M,N) is a local martingale, zero at time zero. A process Ho M is uniquely 
defined by requiring that 

(i) it should be a local square integrable martingale, and 

(ii) <Ho M, Ho M)={H?d<M,M). 

Almost everywhere where the path-wise Lebesgue-Stieltjes integral [ H dM 
is well defined, the paths of HoM and fHdM coincide. Furthermore 
<HoM,KeN)>=|HKd<M,N). We actually make most use of the simple 
corollaries of these results, that if the processes [Hd M and |K4dN exist, if all 
the localizing stopping times above can be taken as constants, and if M(0) 
= N(0)=0, then the following equalities (between functions on [0, 00)) hold: 


&({HdM)=6(j/KdN)=0, (8) 
&({HdMjKdN)=&({/HKd<M,N)). (9) 


If the word “local” can be dropped above, then the same relationships hold on 
[0, oo]. 


Counting Processes. Let {N,;i=1,...,k} be an indexed family of processes with 
right continuous paths, which are zero at time zero, nondecreasing and integer 
valued, and have jumps of size +1 only, no two processes jumping at the same 
time. Suppose that for all t, 


F,=FA,v o{N,(s); i=1,...,k, st}. 


Define T,=0 and T,=inf{t: ) N,(t)2n}, n=1,2,..., ie. T, is the time of the 


n-th jump of {N,; i=1,...,k}. For n21 let J, be the index of the particular 
process which if T, is finite jumps at time T,, i.e. N, (T,)=N,,(T,—)+1 if 
T,< 00. Define F,,(t) to be a regular version of P(T,<t and J,=i|F;,_,), 
te[0, 00), i=1,...,k, and n=1,2,..., and 


«o t dF. 
A= > em (10) 
191_ Y E,(s-) 


i’=1 
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Jacod [8] shows, for i=1,...,k, that M;=N,— A; is a local martingale, zero at 
time zero, while by Gill [6, Propositions 12 and 13], or Liptser and Shiryaev 
[10, Lemma 18.12], the M;’s are local square integrable martingales with 


J(l-4A)dA,  i=j 
—J4A,dA, i+j. a) 


(MyM >=} 


The localizing stopping times may be taken as T,, but if &N,(t)<0o for all i 
and t also as constants. 


2. Random Right Censorship of a Markov Renewal Process 


We suppose that it is not possible to observe complete realizations of Jo, X ,, 
J,,... but rather that observations consist of the values taken by the processes 
Z_,Land A N,; (i,j $m) on some random subset of the time axis. Letting K be 
the indicator process of this set, then we say that the (Markov renewal) process 
is under observation at time t if K(t)=1, when the state at t—, how long it has 
been occupied, and a possible transition at that time instant and the new state 
can be observed; otherwise K(t)=0. Let (Q,4 P) be a complete probability 
space on which the process K and the r.v.’s Jo, X,, J,,... are defined. Then the 
following assumptions about K are made, and with the exception of A3* are 
taken to be in force throughout the rest of the paper, unless explicitly stated 
otherwise: 
Al. There exist r.v.’s T, such that almost surely, 


S,ST,SS,,,Vn and K(th=)x%5,7,,(Vt. 


A2. There exists a sub o-algebra ./ of F containing all P-null sets of F, 
conditionally independent of o {N,,(s); i, jm, se[0, c0)} given Jo, and such that 
for each nT, (see A1) is a stopping time with respect to the family of o-algebras 
{F,; te[0, 00)} defined by 


F,= A O{Io, N,(s); i,j $m, s€[0, t]}. 


A3. &(+#{n: T, >S,})< 0. 

A3*. &((+#{n: T,>S,})’*")< for some e>0. 

The first assumption limits us to what might be called “right-censorship”. 
A2 implies that the process K is predictable and might be interpreted as 
stating that K(t) does not depend on the development of the Markov renewal 
process in [t, 00) given the past at time t. According to A3 or A3* the number 
of at least partially observed sojourn times is certainly almost surely finite. A3 
is sufficient for proving consistency of our estimators, and weak convergence 
when the X;’s take on values in N; for general Q,, we need A3* in proving 
weak convergence. It ought to be possible to weaken A 3* by using perhaps a 
different method of proof: however in practical applications A3 and A3* will 
generally hold. For suppose that for some fixed finite k and t observation 
always stops after at most k transitions after the time instant f; Le. s>Sgq., 





102 R.D. Gill 


=> K(s)=0. In this case + {n: T,>S,}<N(t)+k which has finite moments of all 
orders. 

We mention briefly some special cases. If K(t)=7;0,7(t) for some r.v. T 
which is conditionally independent of X,, J,,... given Jo, then Al and A2 
hold. If m=1, and K(t)=7,0,y,,)(L() for te(S,_,,5,], r=1,...,n and zero on 
(S,,, 00) for a fixed n=1 and positive r.v.’s U,,...,U, independent of X,, X,,... 
then all the assumptions hold and we have in fact the usual model of n 
censored observations of X,,..., X,, with censoring variables U,,..., U,,. 

Next we introduce processes which count observed sojourn times: for u=0 
we define 


N,(u)= #{n21:J,_,=i,J,=j,X, Su, K(S,)=13 
=number of sojourn times in state i observed to take on a 
value Su and to be followed by a jump to state j 
= Bro a AN, (t)=1, Z(t—)=i, L(t) Su, K(t)= 1}, 


N,(u) J=¥N, jf 
j 
For u>0 we define 


Y,(u) = # {n21:J,_,=i,X,2u, K(S,_,+u)=1} 
=number of sojourn times in state i observed to take on a value 2u 
= #{t: Z(t—)=i, L(t)=u, K(t)=1}, 

Y,(0)= Y,(0+), 


Y(u)=> Yu) (also for u=0). (13) 


Note that Y(0)= + {n: T,>S,}. 

By Al and A3, &Y,(u) and &N,(u)S& Y(0)<oo for all i, j and u. The 
sample paths of the processes N,, are almost surely zero at time zero, right 
continuous with left hand limits, nondecreasing and integer valued; while the 
sample paths of Y, are nonincreasing, left continuous with right hand limits, 
and nonnegative integer valued. According to the interpretation of K, these 
processes are observable. 

By the counting process results of Sect. 1 applied to {N, 


;i,j<m}, if we 
define 


ij? 


dQ; (L(s)) 
1—H,(L(s)—) 


Aij()=]xu(Z(s—)) 


Nqt-) Xr qd Li gq 
~/ Xo(J,~ vs resis u) Q; ;(u) 


+ XSi ) Ly (14) 


i, 
alah: (15) 


—H,(u-) 
and 


it follows that with respect to {¥,} (defined in A2) the M,/s are local square 
integrable martingales, zero at time zero, with 
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<M;;,M,,>=J(1— 4 4,) 4 Aj; 
<M; M;;.>= —fA A,,d A;;-, j+j 
{M;;,M;;>=0 i+’. (16) 
Here the localizing stopping times may be taken as constants by finiteness of 
& N,;(t) for all i, j and t. 
Now by A3, it follows that for all i and j 


8 [ K(s)dN,(s)S8 Y()<ov. (17) 


0 


But because K is predictable, nonnegative and bounded, by the stochastic 
integral results of Sect. 1, 


& | K(s)dN,(s)=€)K(s)dA,,(s) for all t<oo. (18) 
0 
So letting t> 00, , 
& | K(s)dA,,(s)<. (19) 
0 


Next, since 


by letting tT 00, we find 


t 2 
limé (|KidM.(s) <0. (21) 
0 


tfo 


But since | Kd M,, is a martingale w.r.t. {F,} this shows that | Kd M,, is in fact 
a (zero mean) square integrable martingale, clearly with 


<JKdM;;,j)KdM,;)>=|Kd<M,;,M,;>). (22) 
Next, as in Gill [6, Lemma 3 and Proposition 4] (where similar results are 
obtained in a rather special case), we find that for any bounded measurable 


functions f and f’ on [0, 0) 
f(u)dN; (u)= ) f (L(t) aN; ,(0), (23) 
0 


a) do.. oa 
ff Yu) 2 (=f PL) KAA, (0), (24) 
0 0 


1-—H,(u- 


and so defining 
r dQ,, 
Z; (v)=N, ;(v)—J Y,(u) 5 Q; ;(u) 
0 


—H,(u-) 
Z(v) =) Z;;(0), 
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we see that 


{ F(L(S)) K() dM, (6)=f fwdZ.(u, (26) 


where f(L) is a bounded predictable process (L is left continuous with right 


hand limits and adapted). So by the theory of stochastic integrals, taking 
expectations in (26), 


0=68 [ fwaZ,w)=f FMdsN,(w)—§ FEY) _ ay 
0 0 0 1—H,(u-) 


and similarly (by (9) and (26)) 


6 ((j ronaz,co) ({ seaz,p(0)) 


=& | f(L(s)) f'(L(s)) K(s)d <M; ;, Mj ;(s)> 
0 


o _ _4Q;;(u) aQ; ;(u) 
J swf (u) & Y,(u) (1 ee) 1—H,(u-)’ 
re rr) ; AQ; (u) dQ; j(u) 
= —{ swf O60) s a—| tae 
0, 








(where the last step follows by similar equalities to (24)). 


3. Estimation of Transition Probabilities Q, , 


Suppose we are given n independent identically distributed observations of N,; 
and Y,. Let Nj, Y;", N?, Zj;, Zj denote the sums of the n realizations of 
N;j> Y;, ... (see definitions (12), (13) and (25)). We build up estimators of Q;, in 
two steps, first estimating H; by H?, defined by 


= ANj(s) 
An(t)=1- (1 -——s ), 29 
where by convention (which we adopt from now on) 0/0=0. An equivalent 
implicit definition is 
: - dN"(s) 
An(t)=((1-A%(s—)) ——. 30 
()={(- ANS) ary (30) 
Now we can define 7, by 


An =~ _ ae d N7';(s) 
gi(=S H;(s—)) ¥,"(s) . 
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Adding over j we see that At=> 07; (cf. (4)). These definitions can be 


j 
motivated by the following facts. Putting in (27) f =%0,,, we find 
t 
EN; ;(t)=J 6 Y;(s)(1—H,(s—))~ * dQ;,(s) 
0 


which implies, adding over j, that 
8 Ni(t)=§ 6 Y,(s)(1—H,(s—))“* dH,(s). 
0 


So if t satisfies & Y,(t)>0, which implies that & Y,(s)>0 and 1—H,(s—)>0 on 
(0, t], we find 
dé N,(s) 


t 
A,(t)=\(1-—H,(s— 
i(t) J i(S—)) 6 Y(s) (32) 
(cf. (30)). We shall see in Lemma 1 that such an expression defines H; uniquely. 
Similarly, also supposing & Y,(t)>0, we find 


i ws * dé N; ,(s) 
2,9=SC H,(s \~BY(s) (33) 


(cf. (31)). Of course we can’t expect to be able to estimate Q;, outside of 
{t: & Y,(t)>0}. 
As a referee remarked, one might also be interested in the analogue of the 
well known “Nelson plot” (see e.g. Aalen [1, Sect. 6.1]), namely 
t 


t 
J ¥;"(s)~* dNjj(s), which estimates the function { (1—H,(s—))~* dQ; ;(s). Its pro- 
0 0 


perties can be derived in exactly the same way as those of Q, j and H, (in fact 
many of the arguments become easier). However we shall not go into the 
details here. 


We first prove consistency, for which we need two lemmas. 


Lemma 1. Let A and B be right continuous nondecreasing functions on [0, 0), 


zero at time zero; suppose AA<1 and AB<1 on [0, 0). Then the unique 
solution Z of 


' Z(s—) 
Z(t)=1-—| ——_—— (dA(s)—dB 
()=1-{ 7 FRG Als) 4B) (34) 
which is locally bounded (i.e. bounded on [0,t] for each t<oo) and right 
continuous with left hand limits is given by 


[] (1-4A(s)) - exp(—A,() 
Z()=*= 
" [] ( - 4B(s)) - exp(—B,(0) 


st 





where A,(t)=A(t)— }, 4A(s), ete. 


st 
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Proof. The proof given by Liptser and Shiryaev [10, Lemma 18.8] for the case 
B=0 goes through exactly, replacing their a with (1—4B)~', A with A—B, 
Z(0) with 1, and « with {(1—4B)-'(dA+dB). 


Corollary. For all t such that 1—H,(t)>0 


AH, ‘ dH, 
1—H,(t)= I (1 “7a es) exp (-J ya a 5) (36) 


and 
1- AR) _, ‘ 1—A"(s—) i AH (s) y dN(s)___ dH,(s) 
1-H,(t) _ ye en, 1—H,(s—) ox 1—H,(s—))° 








(37) 


Proof. Since (34) holds with B=0, A={(1—H,_)~'dH, and Z=1—H, we can 
substitute these quantities in (35) which gives (36). Thus by (22) and (29), (35) 
holds with A=f(Y")-'dN?, B=|(1—-H,_)~'dH,, and Z=(1 —HA")/(1—H)); 
substituting them in (34) gives (37). 0 


Note that if Y,"(t)>0, if 1—H,(t—)>0 and if 1—H,(t)=0 then almost surely 
A"(t)=H,(t)=1; also note that 1— 4H,/(1—H,_)=(1—H)/(1— H,_). So we can 
rewrite (37) as 
A" n dz" 


At —H,=(1- H)§— = “a, YP on 


(38) 


on {t: Y;"(t)>0 and 1—H,(t—)>0}, where Z; was defined in (25) and the 
convention 0/0=0 may have to be invoked. Next we write, also on {t: Y,"(t)>0 
and 1—H,(t—)>0} 


6:,-0,,=f-At) Ge - “Ou ) 5 ( = 


y"  1—d, —H,_ 


n dZi; _1-A"_ n dz? 
“10-B)y Ri 1-H, Y? ) au 


n n 


n fa n dZ} 


“t-F Js —H, Ye n 


4_Q;,J- 
“i n dZ? 

where we have used (31), (37) and integration by parts. (Adding over j and 

integrating by parts gives (38) again of course!) [ 


The next lemma and its corollary give conditions under which expressions 
such as the right hand sides of (38) and (39) converge uniformly in t to zero, in 
probability: this will be the consistency result of Theorem 1. 


Lemma 2. Suppose H,, and Z, are processes on [0,1] whose sample paths are 

almost surely right continuous with left hand limits, of bounded variation, and 

satisfying Z,(0)=0; n=1,2,.... Suppose  ||Z,||= sup |Z,()|7p0. If 
te[0, t] 
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J |dH,(0)| is bounded in probability as noo, then \\{H,_dZ,\|+p0 and 


(0, t) 


f- H,,dZ,,\| >p9; if ) |dH,,(t)| is.bounded in probability, then \\f H,4Z,,| — p90. 
(0, t] 


Proof. We can write 


({H,_4Z,)(t)= J (Z,(t)—Z,(s))dH,(s) 
(0,t) 


(JH,dZ,)()= J (Z,()-Z,(s—))dH,(s). 
(0,t] 


So ||fH,_dZ,|| and |\{- H,dZ,||<2||Z,l| | |dH,(t)|+p0 as noo and simi- 
(0, t) 
larly |\f{H,dZ,||>p0. 0 


Corollary. Suppose Lemma 2 allows us to conclude that fH? dZ,|| +9, i 
=1,...,r, and ||{ H®_ dZ,|| >p0, i=r+1,...,s. Then 


[sflae- 1, we az,|-.0, 
i=1 i=r+1 


as nO. 


Proof. Apply Lemma 2 first to H‘” and Z,, then to H? and fH‘ dZ,, etc. 0 


Theorem 1. Let t;=sup {t: & Y,(t)>0}. Then as n> 
sup |A?(t)—H,(t)|>p0 

te[0, tr] 
and 

sup |Q7,(t)—Q; ,(t)|>p9, 

te[0, ti] 
unless & Y,(t,))=0 and AH,(t,) or AQ; ;(t;)>0, in which case [0,t,] must be 
replaced by [0, t;) in the corresponding supremum. 


Proof. By the weak law of large numbers and monotonicity arguments it is 
easy to show that ||n~' N’;—@N; ||, +p 0 and 


\fn-* ¥"(1—H,_)~*dQ,,—J € ¥,(1—H_)~* dQ, ||, +p 9. 


So ||n~'Z?,\|,,—+p0 (using (27) with f=y,,)). Suppose tSt; is such that 
&(Y,(t))>0. Then ) |\d(n/Y,", (t)| Sn/Y;"(t), bounded in probability as no, 


(0, t) 
and so by the corollary to Lemma 2 and (39) 0? ,—Q; jl, >p0 as n> 00; adding 
over j shows ||H?—H,||,> p90. If & Y,(t;)>0 we are ready. Otherwise monoto- 
nicity arguments show that the required results hold with [0,1;) in place of 
[0,7]; if 4H,(t)=0 or AQ,,(t))=0 then adding t; to the range of the 
supremum in the corresponding term changes nothing. (] 


Actually almost sure convergence in the appropriate set up is also easy to 
derive using the strong law of large numbers and Glivenko-Cantelli type 
arguments, and a suitable modification of Lemma 2. 
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Now we turn to proving weak convergence, giving first a result for the case 
that Q;;'s give weight only to the positive integers. Again we use the repre- 
sentations (38) and (39), and also need Theorem 1. 


Theorem 2. Suppose X ,, X 2, ... take values in N. Let t;=sup {te N: 6 Y;(t)>0} 
(t;€ NU{}). Then {n*(Q?(t)—Q, ;(0), n*(A?(t)—H,(t)); teIN, tSt;, i,j<m} is 
distributed asymptotically as 


~ 1—H,(s—) ‘= (Q; (t)—Q; (s)) 1— H,(s—) 
13, By) Vu ays) «1-H 
‘ 1-H(s—) 1 


(-HO) & T—H@ FY) 





U;(s), 





U;(s); teN, tSt,, iism|. 


where the U; ,(s)’s are multivariate normally distributed r.v.’s with expectations 
zero; U;(s)=>. U; ;(s); and 
j 
i 4) AQ; ,(s) ) AQ; ;(s) 
var (U; ,(s))=& Y;(s) (1 i—H,s—)) 1-H,(s—)’ 
AQ; (s) AQ; ;-(s) 

1—H,(s—) 1—H,(s—) 
cov(U; ;(s), U;-;(s'))=0 i+ or s+s’. 





cov (U; ;(s), U; ;-(s))= — & Y;(s) 





j*/, 


Proof. Multiply (38) and (39) by n? and rewrite the integrals in the right hand 
sides as sums over s<t or s<t as appropriate, with n~*dZ?,(s) replaced with 
n~* AZ? (s). By Theorem 1 and convergence in probability of n~' Y,"(s) the 
coefficients of n~* 4Z?;(s) all converge in probability, while by the central limit 
theorem and (27) and (28) with f= 7,., f’=X;55 


{n-* AZi,(s); i,jsm, se}, {U,,(s); i,jm, seN} 
and the theorem is proved. {J 


Apart from the fact that the final theorem on weak convergence with 
arbitrary Q;, uses A3*, it includes Theorem 2 as a special case. The method of 
proof is essentially the same, though many more technical details are en- 
countered. 


Theorem 3. Suppose A3* holds, and choose t;, im, such that € Y,(t;)>0. Then 
considered as a random element of [|(D[0, t;])"** (see Billingsley [3]) {n*(Q,, 


—Q, ), n*(A,—H)); i, j<m} is asymptotically distributed as 


{yee 1-H. 1 gy, 1-H, 1. 


ay, Mi Ou a ay tT Oumar ay 


Y, 
a 

1-H, i- __4dW,; i,j<m 

ila 1-H, ¢@Y, ” \ 
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where the W,;s are jointly zero mean Gaussian processes with independent 
multivariate increments, the sets {W,,; j<m}, i=1,...,m, being independent of 
one another; W,=)). W,,; and 

j 


ij? 





var (WW (0)={ 8 Y,(s) (1 ws AQ; (s) dQ; ,(s) 


1—H,(s—)) 1—H,(s—)’ 
AQ; ,(s) dQ; +15) 
—H,(s—) 1—H,(s—) 





cov (W, (0), Wis(0)= —§ 8 Ys) 5 
0 


The integrals with respect to W,, and W, are stochastic integrals in the sense of 
Meyer [11] (the W,;s are square integrable martingales with respect to the 
natural family of o-algebras) or can equivalently be defined by formal in- 
tegration by parts, the resulting expressions having a pathwise definition. 


Proof. We only sketch the proof here; details are given as three lemmas in 
Sect. 5. The proof is again based on the representations (38) and (39). Multiply- 
ing these equations by n+, we note that the expressions on the right hand sides 
consist of integrals, where the integrands are products of fixed functions and 
the processes 1—A"_ and n/Y,", which, in probability, converge uniformly on 
[0, t,] to 1—H,_ and (€ Y)~*. The integrals are taken with respect to n-*Z7; 
and n-*Zi=) n-*Z},. Now these processes have finite dimensional distri- 


J 
butions which converge to those of W,; and W, by using the central limit 
theorem and (27) and (28) with f=7;9 ,, and f’=7;,9,,;- In Lemma 3 in Sect. 5 
we prove tightness of n-*Zi, in D[0, t;], here A3* is used. Actually we need to 
prove a little more, because in the next step of the proof we want to apply the 
Skorohod-Dudley theorem (see e.g. Pyke [13]) and consider processes (H” 
—H,, (n~' Y¥'—8&Y,j, and (n-*Z?,) (i,jm, n=1,2,...) defined on a new 
probability space with the same joint distribution for each n as their unprimed 
equivalents, and converging almost surely in the supremum norm to 0, 0, and 
W;'; tespectively (the W,'’s also having the same joint distribution as the W, ;s). 
The construction is possible with the supremum norm distance rather than the 
Skorohod distance, if the sample paths of each W,, can be taken to be 
continuous with probability one; i.e. if the Q; 7s are continuous. However, we 
can get round this problem by inserting a time interval at each jump point t of 
Q;;, joining up n-*Z7,(t—) to n~*Z?,(t) with a straight line across this 
interval, and proving joint weak convergence of these new processes on the 
resulting extended time interval to the corresponding objects obtained from 
the W,,’s, which can be taken to be continuous. Of course we shall need weak 
convergence of the finite dimensional distributions of n~*+Z7,_ and n~*Z7;; 
which again follows from the central limit theorem and (27) and (28). In 
Lemma 4 we show that the above programme can indeed be carried out. Now 
we are at liberty to apply the Skorohod-Dudley theorem (after which we 
remove the extra intervals again). Finally the corollary to Lemma 5 shows that 
the suprema over [0, t] of the absolute difference between the primed versions 
of n? times (38) and (39) and their “obvious” limits (given in the statement of 
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the theorem above) converge almost surely to zero, where some care is needed 
because these obvious limits cannot be defined directly as pathwise integrals if 


the Q;;s have continuous components. This problem is also resolved in 
Lemma 5. 0 


4. Remarks and Generalizations 


Though the limiting covariances of n* (0? ;—Q, j) can be consistently estimated, 
it seems difficult to use them to construct confidence bands. At least this is not 
the case for n?(H"—H,); e.g. Gill [6 or 7]. An advantage of the Nelson plot 
mentioned earlier (immediately before Lemma 1) is that confidence bands for 
them are easy to construct, because the corresponding asymptotic process will 
have independent increments; i.e. we will obtain a deterministically time 
transformed Brownian motion. Also Theorem 3 does not give us in general a 
limiting distribution of the estimators of the transition probabilities Q; (00) of 
the Markov chain Jo,J,,... associated with the Markov renewal process, 
supposing that estimation should be possible (Theorem 1 gives conditions for 
consistent estimation). 

Theorem 3 specialized to the case m=1 and assuming a rather special form 
for the censoring process K gives us a weak convergence theorem for the so- 
called product limit estimator. This result generalizes that obtained by Breslow 
and Crowley [4], for which continuous distributions of both censoring vari- 
ables and the variables of interest were assumed. In Gill [7] we derive this 
result in a more direct fashion and in more generality. 

We next discuss possible generalizations of our results. Firstly, can we drop 
the restriction Al to right censorship? In the discrete case this gives no 
problems: we must assume that @ Y,(s)<0o for all i and s=1, 2, ... and realize 
that we can only estimate 4Q; ,(s)/(1 — H;(s—)) for s such that & Y;(s)>0 (where 
Y, is still defined by (13)). However in the general case we have made strong 
use of many of the properties of the processes Y, as the following list of 
“corrections” shows: replace Al with the assumption that the sample paths of 
Y, are left continuous with right hand limits; in A2 instead of the stopping 
time condition assume that K is predictable with respect to the given o- 
algebras; and in A3 and A3* replace #{n: T,>S,} with +{n: K(s)=1 for 
some sé(S,,5S,,,]}. Theorem 1 then remains true if we replace t; with 1; 
=sup {t: & of |dY,,(s)|<oo and & min, ,, ¥,(s)>0} and modify the con- 


dition ‘ susie: & Y,(t;)=0” accordingly. Theorem 3 remains valid if we choose 
t; such that & a ws, ,(Di|<o0, & mia Y,(s)>0, \ln~' ¥"—@Y,||,,>p0 and 
0, ti) 


) \dé Y, , (t)| be <0, eit actually the last two properties are a consequence 
(0, ti) 


of the first one. 
Alternatively, what happens when we drop the assumption of identically 
distributed + saps of {N;;, Y¥,}? Consider a triangular array {Ni + ue 
i, jm}, k=1,. ;n=1,2,..., for each (k, n) defined as in Sects. 1 and 2 with 
fixed Q; ;s, but pane differing initial distributions and “censoring distri- 
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butions”, and independent over k for each n. Define 


Ni'= > Nii 


k=1 


Yi= > a 
k=1 


and proceed as in Sect. 3. It is not too difficult to see that if we replace A3 with 
the assumption that for some fixed C, & ¥Y*"(0)<C<oo for all k and n and 
similarly modify A3*, and define & Y, to be lim n~' & Y,", which we assume to 
exist and to be left continuous with right hand limits, then Theorems 1 to 3 
still hold. Important for applications to medical trials is the fact that even if for 
some states i the transition probabilities vary with n and k, for the other states 
all our results go through. 

Finally, instead of looking at n independent observations, suppose that a 
single Markov renewal process is, with censoring, observed over an expanding 
sequence of time intervals. As a specific example, consider the model of Sects. 
1 and 2 where we drop assumptions Al and A3 but do suppose that the X;’s 
take values in N. In A2 we assume that K is predictable instead of making the 
stopping times assumption. Defining 


Y;"(u)= #{tgn: Z(t—)=i, L(t)=u, K(t)=1} 
and 
AN} (u)= #{tSn: AN, (t)=1, Z(t—)=i, L()=u, K(t)=1} 


(cf. (12) and (13)) it is possible, as in Bather [2, Lemma 1], to apply Chow [5, 
Theorem 5] to show that on the set of w for which Y,"(u)— 00 as noo (for 
fixed i and u), A N?,(u)/Y;"(u)> 4Q, (u)/(1 — H,(u—)) a.s. It is not yet clear to the 
author what can be done for general Q; ;'s, nor indeed in this special case how 
weak convergence can be proved. 

It should be pointed out that we could not apply the general theory of 
Aalen [1] to derive our results, despite the strong similarity of models. This is 
because of the occurrence of the process L in formula (14), whose saw-tooth 
paths prevent the transformation (26) from M, , to Z; ; (defined in (15) and (25)) 
from preserving the martingale property of M;;. However, it preserves enough 
of it (the properties of first and second moments) for our asymptotic results. 


5. Technical Lemmas Needed for Proof of Theorem 3 


Lemma 3. For each i, j and t such that & Y,(t)>9, n-*Zi; is tight in D[0,t] as 
noo if A3* holds. 


Proof. Let I, and I, be the intervals (t,, t] and (t, t,] for some time instants 


OSt, <t<t,St. Write A,X for f dX, k=1 or 2, for a process or function X of 
Ik 
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bounded variation. We show that we can find C>0 and «e(4, 1) such that 
n~?6(A,Zi,- 4,Z}))? $C(4,Q;;- 429; )*, (40) 


which proves tightness in view of Billingsley [3, Theorem 15.6 and the remarks 
on p. 133]. Now (4, Z/;, 4,Z7,) is a sum of n independent random variables 
with zero means (by (27) with f = ,,) each distributed as (4, Z;;, 4,Z;;). So 
n~*8(A,Z},-4,Z3,)? =n-' 6(A,Z;;- 4,Z;;) 
+((n—1)/n) 8(4, Z;)° €(4,Z;,;) 
+2((n—1)/n)(6(4,Z;;-4,Z;;))’. 
Replacing f with y,, and f’ with y,, in (28) shows that 


while replacing f and f’ by x,, shows 





= des ij ij 
6(4,2,)°=§ 6, (1-3 iL); H 


<CA,Q;; because (1—H,_)~! is bounded on [0, t] 
SC(4,Q;))* for any «e€(0, 1). (43) 


Note that in the sequel the constant C may be different on each appearance; 

however, it can always be chosen not to depend on J, and J, though it will 

often depend on «a. Substituting (42) and (43) back in (41), we see that to 

establish (40) it remains to suitably bound &(4, Z;;- 4,Z;;)’. Now since 4,Z;; 

=A,N,;— J ¥(1-H;_)~'dQ 
Tk 


ip? 
4,24; 422A S4iNy 42Nj+( Yl Hi) "dO, (J Y¥(1—H,_)~*dQ;,) 
+4,Nij(J Y(1- Hy) *d0,)+ ANI ¥,(1—H;_)~'dQ;,) 
<cY(?, (44) 
while expanding (42) we find that for «, =1—f, €(0, 1) 


6(A,Nj-4.N,)+6(J ¥,—H,_)-*dQ,)-(f ¥(1—H,_)-*4Q,,))) 
I; 12 


=&(4,N,;-(f ¥(1—H,_)~'dQ,))+6(4,N,;-(J Y,(1—H,_)~*dQ,,)) 
I2 Ii 


SC-&(4, Ni;" Y(0))-4,Q;;+ C-&(4,N;;- Y(0))- A, 0Q;; 
SC-[6((4,N,)*- ¥()'*")-(4,.0,)" 

+ &((4, N,)*- ¥(0)'*")-(4, Q;)"] 
S C[(E(4, N,))*(6 ¥(0)' * "(4 Q,)" 

+(8(4, N,,))"*(6 ¥(0)'*")"*(4, ,)"] 
SC(4,Q;;-4,9;)" 
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if 
&(¥(0))'* 9") <0, 
where we have used Hdlder’s inequality and the fact that 


6A,N,,= | €Y,1—H,_)~'dQ;.. 
ij i i ij 


Therefore for «,=1—f,€(0, 1), 
6(A,Z;;-4,Z;;)? =E(\A, Z,;- 42 Z,j17 +14, Z;j;- 42 Z,j|' +”) 
$C-&(\A,Z;;-4,Z;;\ Y(0)?*7*) (by (44)) 
SC(6 14, ZA, Z,)"(EYOP*7) (47) 
(by Hélder’s inequality) 
$C(4,Q;;-429;)" 
by (44) and (45) if (46) holds and if 
& (Y¥ (0)? * 7/82) < a, (48) 


Now by A3* we can choose B, <4 such that (46) holds and £,<? such that 
(48) holds. For such a choice «, «,=(1—B,)(1—B,)>2-3=4 and therefore, by 
(47), for sufficiently small «>4 we can find a C<oo such that 


6(A4,Z,;-4,Z,)? $C(4,9,;-42;)* for all t,, t, t,, (49) 
which completes the proof. { 


We have now shown that for 1; satisfying & Y,(t;)>0, 


{n-*Zi;i,jsm}—>g{W,;;i,jgm} on []D[O0,7,]" 
i=1 


where the limit has been defined in the statement of Theorem 3, while jointly 
H"—H,>,0 and 0?,-Q;;> 90 each on D[0,t;]. The next lemma on the 
Skorohod-Dudley construction is only stated and proved for a single process 
n-* Zi, but the required simultaneous result can obviously be proved in the 
same way. 


Lemma 4. A Skorohod-Dudley construction is possible for n-* Zi, n=1,2,... 


with respect to the supremum norm on D[0,t;] where t; satisfies & Y;(t;)>0. 


Proof. Write W,=n-*Zi,, n=1,2,...; W=W,,; and t=7,. Let t,,t,,... be an 
enumeration of the jump points of Q;; in [0,t] and let 6,=4Q,,(t,)20 for all k, 
> 6, <1. Define v(t)=t+ Y 5, and 6(t)=v(t)—v(t—)=6, if t=t, for some k 
k th St 

=1,2,... and =0 otherwise. By the comments above, W,>,W as noo in 


D[0,t] where W is a zero mean Gaussian process with independent increments 
t 
and variance function var(W(t))={ f(s)d Q;(s)= A(t) say, for some bounded 


0 
non-negative measurable function f on [0,t]. Let W* be the zero mean 
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Gaussian process on [0, v(t)] with independent increments and almost surely 
continuous sample paths, such that var(W*(u))= A(t—)+ f (t)(u—v(t—)) where 
t is the unique solution of v(t—)<Su<v(t). (W* exists with these properties 
because var(W*(u)) is a continuous nondecreasing function of u.) 

Now define, for ue[0, v(t)], n=1, 2, ... and m=1, 2, ..., 0, 
W,(t) if u=v(t) for some te[0, t] 
W,(t,—) if v(t,—)Su<v(t,) for some k>m 

Oy 


if v(t,—)Su<v(t,) for some kSm 


W,(t,—)+ - (W,(t,) — W(t, —)) 





and define, also for ue[0, v(t)] and m=1, 2, ..., 00, 
W*(v(t)) if u=v(t) for some te[0, t] 
W*(v(t,—) if v(t,—)Su<v(t,) for some k>m 


W*(v(t,-) + 
k 


if v(t, —)Su<v(t,) for some k<m. 


(W*(v(t,))— W*(v(t,—)) 





In words, for n=1, 2, ..., W,*,, is obtained from W, by inserting time intervals of 
length 6, at ¢, and joining W,(t,—) to W,(t,) by a straight line across this 
interval if k<m, but continuing a horizontal line from W,(t,—) if k>m; while 
W;,, is obtained from W* in a similar way except that W* has already been 
defined on the extended time interval. We see from this construction that W¥,, 
is a random element of D[0, v(t)] for m<oo while W* ,, is a random element 
of C0, v(rz)]. 

Now fixing m<oo for the moment, we can prove that W,*,>gWI ,, as 
n— co in D[O, v(z)]. For the convergence of the finite dimensional distributions 
is again straightforward. Tightness is proved by proving tightness on each of 
the 2m+1 intervals [0, v(s, —)], [v(s, —), v(s,)], ---, [v(S,), ¥(t)] where s,,...,5,, 
is t,,...,¢,, put into increasing order. Tightness on an interval of the form 
[v(s,—), v(s,)] follows from convergence in distribution of (W,(s,—), W,(s,)) as 
n— oo, while tightness on [v(s,_ ,), v(s,—)] follows from tightness of W, (rede- 
fined in the points s, as W,(s,—)) on [s,_,,5,] and the observation that the 
modulus of continuity w’‘(6) is smaller for W,*, on [v(s,_ ,), v(s,—)] than it is 
for W, on [s,_ ;,5,]. 


Next we show that &||W*,—W,*,||2,,20 as m—co uniformly in n 
=1,2,...,00. For any n 


1W2.— We oka= SUP (Wy ((ty)) — Watao(M(t,—))) 
<2 (W,*0 (V(t) — W,*.. (V(t, —)))’, 


which implies that 


é ! We we \ sah y Ff (ty) 6,70 
k>m 





Estimation for a Censored Markov Renewal Process 115 


as m>oco uniformly in n=1,2,...,00. Since the Skorohod do-distance on 
D[0, v(t)] is smaller than the supremum norm distance, we have now shown 
that in D[0, v(t)] 

Win gWem as noo for m=1,2,... 

WE n7>gWs . as moo 


2) 


lim limsup P(d,( 


m~ oo no 


W,*,,W*,)2e)=0 for all e>0. 


So by Billingsley [3, Theorem 4.2] 
We.7gWs. as no. 


Since W* ,, has almost surely continuous sample paths we can apply the 
Skorohod- Dudley theorem; and going back to the interval [0,1] we have 
finally constructed W,;, n=1,2,... and W’ having the same marginal distri- 
butions as W, and Ww but now “defined on a single probability space and 
satisfying ||W,;—W’||.-0 almost surely asn>oo. QJ 


Lemma 5. Let H,, and Z,, n=1,2,... and Z be random elements of D[0,t] 
defined on a single probability space (Q,¥, P) and such that with probability 1, 
H,, and Z,, have paths of bounded variation and Z,(0)=0 for each n=1,2,...,(*) 
limsup { |dH,(s)|<0, |Z,—Z\|+0, and \|H,—h\|+0 where h is a fixed 


n—>o (0,1) 


function of bounded variation on [0,t]. Then \{H,_4Z,—\h_dZ\|-0 and 


IH, dZ,, —fhdZ\|0 almost surely, where fhdZ is defined as hZ — §Z_ dh and 
fh_dZ as hZ—|Zdh (because Z does not necessarily have paths of bounded 
variation). In fact to conclude |fH,_4Z,—jh_dZ\||0 or lf" H,4Z,, 
—|~hdZ\-0 almost surely we can weaken («) to limsup J |dH,(s)|<0. 


no (0,1) 
If furthermore stochastic integrals h_oZ and hoZ (Meyer [11, definition 
18]) can be defined, then these coincide with fh_dZ and jhdZ, respectively. 


Proof. Fix an meQ not in the exceptional event of probability zero specified 
above, and denote by H,, Z, and Z the functions on [0,7]: H,(@), Z,(@) and 
Z(qw). Choose an e>0. Then there exists a Z* (= Z*(q,<&)) such that Z*(0)=0, 
Z* is of bounded variation, and ||Z—Z*||<e. Next we write 


fH, 4Z,—Jh_dZ\S\\fH,_4dZ,—JH,_dZ*| 
+ ||fH,_dZ*-—fh_dZ*\|+ \fh_dZ*—fh_dZ|| 
=A,+B,+C (say). 


Now using the formula j X(s—)d Y(s)= j (Y(t)— Y(s))d X(s) for functions 


(0,1) (0,1) 
X, Y in D[0,t] of bounded variation and with Y(0)=0, and the fact that 
limsup | |dH,(s)|<0o it is easy to show that 


no (0,1) 


limsupA,=c(1) as ¢|0 
no 


and that 
=o(l) as e190. 
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Finally B,<|\H,—h|\| | |dZ*(s)|+0 as no, so combining these relationships 


0,t 
we have the seelond semi: Similar arguments establish the other assertions of 
almost sure convergence in norm. 

Now we look at the second part of the lemma. By Meyer [11, IV n° 23], if 
h_ocZ and hoZ can be defined, hZ=h_oZ+Zch; by [11, IV n° 29], Zoh 
=|[Zdh. So hZ—SZdh=hoZ. Again by [11, IV n°38] hZ=heoZ+Zch 
where according to [11, IV n° 29] Z_ch=|Z_dh.SohZ-JZ_dh=hoZ. ] 


Corollary. Suppose Lemma 5 allows us to conclude that almost surely 
\fHOdZ,—fhdZ\>0,  i=1,...,r 
and 
\H® dZ,—fhdZ\|>0, i=r+1,...,s. 


s 


Then |\{[[H®- [] H® .dZ,-J[]h® [] h@dZ}-0 almost surely. 


i=1 i=r+1 i=1 i=r+1 


Proof. Apply Lemma 5 first to H‘ and Z,, then to H?) and JH“ dZ,, etc. 0] 
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